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This invcntiOT relates lo polynu- 
clclotides encoding Glycoprotein B from 
the RFHV/KSHV subfamily of gamma 
herpes viruses, three members of which 
are characterized in detail. DNA extracts 
were obtained from Macaque nemestrina 
and Macaque muiatta monkeys affected 
with retroperitoneal fibromatosis (RF). and 
human AIDS patients affected with Kaposi's 
sarcoma (KS). The extracts were amplified 
using consensus -dcgencraic oligonucleotide 
probes designed from icnown procein and 
DNA sequences of gamma herpes viruses. 
The nucleotide sequences of a 319 base pair 
fragment are about 76 % identical between 
RFHVl and KSHV. and about 60-63 % 
identical with the closest related gamma 
herpes vimses outside the RFHV/KSHV 
subfamily. Protein sequences encoded within 
these fragments are about 91 % identical 
between RFHVl and KSHV. and < - 65 
% identical to that of other gamma herpes 
viruses. The full-length KSHV Glycoprotein 
B sequence comprises a transmembrane 
domain near the N-tcrminus, and a 
plurality of potentially antigenic sites in the 
extracellular domain. Materials and methods 
are provided to characterize Glycoprotein 
B encoding regions of members of the 

RFHV/KSHV subfamily, including but not ^ a 

limited to RFHVl. RFHV2. and KSHV Peptides, polynucleotides, and antibodies of this mvcnuon can be used 
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for diagnosing infection. 



and for eliciting an immune response against Glycoprotein B. 
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glycoprotpim b of the r fhv/kshv Subfamily qf Herpes Viruses 



Technical Field 

The present invention relates generally to the field of virology, particularly viruses of the herpes 
family. More specifically, it relates to the IdentHication and characterization of herpes vims Glycoprotein B 
molecules which are associated with fibre proliferBtive and neoplastic conditions in primates, including 
humans. 



RanKCROUND ART 



Kaposi's Sarcoma is a disfiguring and potentially fata! form of hemorrhagic sarcoma. It is 
characterized by multiple vascular tumors that appear on the skin as daricly colored plaques or nodules. At 
the histological level, it is characterized by proliferation of relatively unrform spindle-shaped cells, forming 
fascicles and vascular slits. There is often evidence of plasma cells. T cells and monocytes In the 
inflammatory infiltrate. Death may ultimately ensue due to bleeding from gastrointestinal lesions or from an 
associated lymphoma. (See generally Martin et al.. Finesmith et al.) 

Once a relatively obscure disease, it has leapt to public attention due to its association with AIDS. 
As many as 20% of certain AIDS-affected populations acquire Kaposi's during the course of the disease. 
Kaposi's Sarcoma occurs in other conditions associated with immunodeficiency, including kidney dialysis 
and therapeutic immunosuppression. However, the epidemiology of the disease has suggested that 
immunodeficiency is not the only causative factor. In particular, the high degree of association of Kaposi's 
with certain sexual practices suggests the involvement of an etiotogic agent which is not the human 
immunodeficiency virus {Berel et al ). 

A herpes-virus-like DMA sequence has been identified in tissue samples from Kaposi's lesions 
obtained from AIDS patients (Chang et al.. confirmed by Ambroziuk et al.). The sequence was obtained by 
representational difference analysis (Lisitsyn et al.). in which D^4A from affected and unaffected tissue 
were amplified using unrelated priming oligonucleotides, and then hybridized together to highlight 
differences between the cells. The sequence was partly identical to known sequences of the Epstein Ban- 
Vinjs and herpesviois saimiri. It coded for capsid and tegument proteins, two structural components 
sequestered In the viral interior. In a survey of tissues from various sources, the sequence was found in 
95% of Kaposi's sarcoma lesions, regardless of the patients' HIV status (Moore et al. 1995a). 21% of 
uninvolved tissue from the same patients was positive, while 5% of samples from a control population was 
positive. There was approximately 0.5% sequence variation between samples 

The same sequence has been detected in body cavity lymphoma, a lymphomatous effijsion with 
B^II features, occurring uniquely in AIDS patients (Cesarman et al.). The copy number was higher in 
body cavity lymphoma, compared with Kaposi's Sarcoma. Other AIDS-assodated lymphomas were 
negative. The sequence has also t>een found in peripheral Wood mononuclear cells of patients with 
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Castleman's disease (Dupin et al.). This is a condition characterized by morphologic features of 
angiofolicular hyperplasia, and associated with fever, adenopathy, and splenorDegaly. The putative virus 
from which the sequence is derived has twcome known as Kaposi's Sarcoma associated Herpes Virus 
(KSHV). 

5 Using PCR in situ hytwidization. Boshoff et al. have detected KSHV polynucleotide sequences in 

the cell types thought to represent neoplastic cells in Kaposi's sarcoma. Serological evidence supports an 
important role for KSHV in the etiology of Kaposi's sarcoma (O'Leary). Kedes et al. developed an 
immunofluorescence serological assay that detects antibody to a latency-associated nuclear antigen in B 
cells latently infected with KSHV. and found that KSHV seropositivity Is high in patients with Kaposi's 

10 sarcoma. Gao et al. found that of 40 patients with Kaposi's sarcoma. 32 were positive for antibodies 
against KSHV antigens by an immunoWot assay, as compared with only 7 of 40 homosexual men without 
Kaposi's sarcoma immediately before the onset of AIDS. Miller et al. prepared KSHV antigens from a 
body cavity lymphoma celt line containing the genomes of both KSHV and Epstein-Ban- virus. Antibodies 
to one antigen, designated p40. were identified in 32 of 48 HIV-1 infected patients with Kaposi's sarcoma. 

15 as compared with only 7 of 54 HIV-1 infected patients without Kaposi's sarcoma. 

Zhong et al. analyzed the expression of KSHV sequences in affected tissue at the messenger RMA 
level. Two small transcripts were found that represent the bulk of the vims specific RNA transcribed from 
the KSHV genome. One transcript was predicted to er>code a small membrane protein; the other is an 
unusual poly-A RNA that accumulates in the nucleus and may have no protein encoding sequence. 

20 Messenger RNA was analyzed by cloning a plurality of overlapping KSHV genomic fragments that 
spanned the -120 kb KSHV genome from a lambda library of genomic DNA. The dones were used as 
probes for Northern analysts, but their sequences were not obtained or disclosed. 

Moore et al. have partially characterized a KSHV genome fragment obtained from a body-cavity 
lymphoma. A 20.7 kb region of the genome was reportedly sequenced, although the sequence was not 

25 disclosed. 17 partial or complete open reading frames were present in this fragment, all except one having 
sequence and positional homology to other known gamma herpes virus genes, including the capsid 
maturation gene and the thymidine kinase gene. Phylogenettc analysis showed that KSHV was more 
closely related to equine herpes virus 2 and Saimiri virus than to Epstein Barr virus The 20.7 kb region did 
not contain sequences encoding either Glycoprotein B or DNA polymerase. 

30 The herpes virus family as a wt>ole comprises a number of multi-enveloped viruses about 100 nm in 

size, and capable of infecting vertebrates. (For general reviews, see. e.g.. Emery et al.. Fields et al.). The 
double-stranded DNA genome is unusually large - from about 88 to about 229 kilobases in length. It may 
produce over 50 different transcripts at various stages in the life cycle of the virus. A number of 
glycoproteins are expressed at the viral surface, and play a role in recognition of a target cell tjy the virus. 

35 and penetration of the virus into the cell. These surface proteins are relatively more variant t)etween 
species, compared with internal viral components (Kariin et al ). The same surface proteins are also 
present on defective viral particles produced by cells harboring the virus. One such non-infectious form is 
the L-partide. which comprises a tegument and a viral envelope, but lacks the nudeocapsid. 
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The herpes vims family has been divided into several subfamilies. Assignments to each of the 
categories were originally based on biologic properties, end are being refined as genomic sequence data 
emerges. The alpha subfamily comprises vinjses that have a broad host range, a short replicative cycJe. 
and an affinity for the sensory ganglia. They include the human simplex vinjs and the Varicella-zoster 
5 vinjs. The beta subfamily comprises vinjses that have a restricted host range. ar>d include 
Cytomegaloviais and human Herpes Vims 6. The gamma subfamily comprises viaises that are generally 
lymphotrophic. The DMA is mariced by a segment of about 1 10 kilobases with a low GC content, flanked 
by multiple tandem repeats of high GC content. The gamma subfamily includes Epstein Ban- Vims (EBV), 
herpes vims salmiri. equine Herpes Vims 2 and 5, and bovine Herpes Vims 4. 

10 Herpes vimses are associated with conditions that have a complex clinical course. A feature of 

many herpes vimses is the ability to go into a latent state within the host for an extended period of time. 
Vimses of the alpha subfamily maintain latent forms in the sensory and autonomic ganglia, whereas those 
of the gamma subfamily maintain latent forms, for example, in cells of the lymphocyte lineage. Latency is 
associated with the transcription of certain viral genes, and may persist for decades unUI conditions are 

15 optimal for the vims to resume active replication. Such conditions may include an immunodeficiency. In 
addition, some herpes vimses of the gamma subfamily have tiie ability to genetically transform the cells 
they infect. For example. EBV is associated with B cell lymphomas, oral hairy leukoplakia, lymphoid 
interstitial pneumonitis, and nasopharyngeal carcinoma. 

A number of other conditions occur in humans and other vertebrates that involve fitxoprolfferation 

20 and the generation of pre-r«oplastic calls. Examples occurring in humans are retroperitoneal fibrosis, 
nodular fibromatosis, pseudosarcomatous fibromatosis, and sclerosing mesenteritis. Another condition 
known as Enzootic Retiriperitoneal Fibromatosis (RF) has been obsen^ed in a colony of macaque 
monkeys at the University of Washington Regional Primate Research Center (Giddens et al.). Late stages 
of the disease are characterized by proliferating fibrous tissue around the mesentery arKl the dorsal part of 

25 the peritoneal cavity, with extension into the inguinal canal, through the diaphragm, and into the atxJominal 
wall. Or>ce clinically apparent, the disease is invariably fatal within 1-2 months. The condition has twen 
associated with simian immunodeficiency (SAIDS) due to a type D simian reti^jvims. SRV-2 (Tsai et a!.). 
However, other co(onies do not show the same frequerKry of RF amongst monkeys affected with SAIDS. 
arxJ the frequency of RF at Washington has been declining in recent years. 

30 The study of such conditions in norvhuman primates is important not only as a model for human 

corwlitions. txrt also because one primate species may act as a reservoir of vimses that affect another 
species. For example, the herpes vims saimiri appears to cause no disease in its natural host, the squirrel 
monkey (Saimiri sdureus). but it causes polyclonal T-cell lymphomas and acute leukemias in other 
primates, particularty owl monkeys. 

35 There is a need to develop reagents and methods for use in the detection and ti^atment of herpes 

vims infections. The etiological linkage between KSHV and Kaposi's sarcoma, confirmed by the 
serologtcal evidence, indkates the importance of this need. 

For example, there is a need to develop reagents and methods which can be used in the diagnosis 
and assessment of Kaposi's sarcoma, and similar conditions. Being able to detect the etiotogic agent in a 
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new patient may assist in diffenentiat diagnosis: being able to assess the tevel of the agent in an ongoing 
condition may assist in clinical management. Desirable martters include those that provide a very 
sensitive indication of the presence of both active and latent fomis viral infection, analogous to the HBsAg 
of Hepatitis B. Desirable markers also Include those that are immunogenic, and can be used to assess 
immunological exposure to the viral agent as manifest in the antibody response. Glycoprotein antigens 
from the viral envelope are particularly suitable as markers with these characteristics. They may be 
expressed at high abundance near the surface not only of replicative fomns of the virus, but also on L- 
partides produced by virally infected celts. 

Second, there is a need to develop reagents and methods that can be used for treatment of viral 
infection - both prophylactically. and following a viral challenge. Such reagents include vaccines that 
confer a level of immunity against the virus. Passive vaccines, such as those comprising an anti-virus 
antibody, may be used to provide immediate protection or prevent cell penetration and replication of the 
virus in a recently exposed individual. Active vaccines, such as those comprising an immunogenic viral 
component, may be used to elicit an active and ongoing immune response in an individual. Antibody 
elicited by an active vacdne may help protect an individual against a subsequent challenge by live virus. 
Cytotoxic T cells elicited by an active vaccine may help eradicate a concunent infection by eliminating host 
celts involved in viral replication. Suitable targets for a protective immune response, particularly antibody, 
are protein antigens exposed on the surface of viral partides. and those implicated in fusion of the vims 
with target cells. 

Third, there is a need to develop reagents and methods which can be used in the development of 
new phamnaceuticals for Kaposi's sarcoma, and similar conditions. The cun^nt treatment for Kaposi s is 
radiation in combination with traditional chemotherapy, such as vincristine (Northfett. Mitsuyasu). While 
lesions respond to these modalities, the response is temporary, and the downward clinical course 
generally resumes. Even experimental therapies, such as treatment with cytokines, are directed at the 
symptoms of the disease rather than the cause. Drug screening and rational daig design based upon the 
ebologic agent can be directed towards the long-felt need for a dinica) regimen with long-tenn efficacy. 
Suitable targets for such phamiaceuticals are viral components involved in recognition and penetration of 
host cells. These indude glycoprotein components of the viral envelope 

Fourth, there is a need to develop reagents and methods which can be used to identify new viral 
agents that may be assodated with other fibroprolrferative conditions. The representational difference 
analysis technique used by Chang et al. is arduously complex, and probably not appropriate as a general 
screening test. More desirable are a set of oligonudeotide probes, peptides, and antibodies to be used as 
reagents in more routine assays for sun^eying a variety of tissue samples suspeded of containing a related 
etiologic agent. The reagents should be suffidently spedfic to avoid identifying unrelated vinjses and 
endogenous components of the host, and may be sufficienUy cross-reactive to identify related but 
previously undescribed viral pathogens. 
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Disclosure of thp iMVgMTiQN 

It is an objective of this invention to provide isolated polynucleotides, polypeptides, and antibodies 
derived from or reactive with the products of novel genes encoding Glycoprotein B molecules of the 
5 RFHV/KSHV subfamily of herpes vinjses. Two membere of the family are Retroperitoneal Fibromatosis 
associated Herpes Virus (RFHV) and Kaposi's Sarcoma associated Herpes Virus (KSHV). These 
materials and related methods can be used in the diagnosis and treatment of herpes virus infection in 
primates, including humans. Isolated or recombinant Glycoprotein B fragments or polynucleotides 
encoding them may be used as components of an active herpes vaccine, while antibodies specific for 

1 0 Glycoprotein B may be used as components of a passive vaccine. 

Accordingly, one of the emtxxjiments of the invention is an isolated polynucleotide with a region 
encoding a Glycoprotein B of a herpes virus of the RFHV/KSHV subfamily, the potynudeotide comprisir^g 
a sequence of 319 nucleotides at least 65% identical to nucleotides 36 to 354 of SEQ. ID N0:1 or SEQ. ID 
N0:3. which are 319 nucleotide fragments encoding Glycoprotein B from RFHV and KSHV. respectively 

15 Also embodied is an isolated polynucleotide with a region encoding a Glycoprotein B. the polynucleotide 
comprising a sequence selected from the group consisting of: a sequence of 35 nucleotides at least 74% 
identical to oligonucleotide SHMDA (SEQ. ID NO:41); a sequence of 30 nucleotides at least 73% identical 
to oligonucleotide CFSSB (SEQ. ID NO:43): a sequence of 29 nucleotides at least 72% identical to 
oligonucleotide ENTFA (SEQ. ID NO:45); and a sequence of 35 nucleotides at least 80% identical to 

20 oligonucleotide DNIQB (SEQ. ID N0:46). 

Another embodiment of the invention is an isolated polynucleotide comprising a fragment of at least 
21. preferably 35. more preferably 50, stilt more preferably 75. and even more preferably 100 consecutive 
nucleotides of the Glycoprotein B encoding region of the potynudeotide of the preceding emtx>diments. 
The potynudeotide is preferably from a virus capable of infecting primates, fnduded are Glycoprotein B 

25 encoding polynucleotide fragments from RFHV and KSHV. Another emttodrment of the invention is an 
isolated potynudeotide comprising a linear sequence of at least about 21 nudeotides identical to a the 
Glycoprotein B encoding sequence t}etween nucleotides 36 to 354 indusive of SEQ. ID NQ:1. SEQ. ID 
NO:3, or SEQ. ID NO:92. or anywhere within SEQ. ID NO;96. but not in SEQ. ID NO:98 

A further emtjodrrrient of this invention is an isolated polypeptide encoded by any of the previous 

30 embodiments. Also embodied is an isolated polypeptide, comprising a linear sequence of at least 17 
amino adds essentially identical to the Glycoprotein B protein sequence shown in SEQ. ID N0:2, SEQ. ID 
NO:4, or SEQ. ID NO;97. or anywhere within SEQ ID NO:94 (KSHV), but not in SEQ. ID NO:99. This 
indudes fusion polypeptides, immunogenic poiypeptides, and polypeptides occurring in glycosylated and 
unglycosylated form. Some preferred antigen peptides are listed in SEQ ID NOS:67-76. Also embodied 

35 are isolated and norvnaturally occurring polynucleotides encoding any of the aforementioned polypeptides, 
along with doning vectors, expression vectors and transfected host cells derived therefrom Further 
embodiments are mettrod for produdng polynucleotides or polypeptides of this invention, comprising 
replicating vectors of the invention or expressing pdynudeotides in suitable host c»lls 
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Yet another embodiment of this invention is a monoclonal or isolated polyclonal antibody specific for 
a Glycoprotein B polypeptide embodied in tNs invention, or a Glycoprotein B encoded in tne encoding 
region of a polynucleotide embodied in this invention. The antibodies are specific for memtjers of the 
RFHV/KSHV subfamily, and do not cross-react with more distantly related Glycoprotein B sequences, 
5 particuiany SEQ. ID NOS:3(M1 . 

other Glycoprotein Bantitxxiies are specificA morHxdonal or isolated polyclonal antibody specific for the 
polypeptide of claim 9. but not for a polypeptide having an amino add sequence of any of SEQ. ID 
NOS:30-41. 

Still another embodiment of this invention is a vaccine comprising a polypeptide of this invention in 

10 a pharmaceutically compatible exdpient, and optionally also comprising an adjuvant. Another embodiment 
of this invention is a vaccine comprising a polynucleotide of this invention, which may be in the form of a 
live virus or viral expression vector Another emtjodiment of this invention is a vaccine comprising an 
antitxxjy of this invention in a pharmaceutically compatible excipient. Other emtxxliments are methods for 
treating a herpes virus infection, either prophytacticalty or during an ongoing infection, comprising 

1 5 administering one of the aforementioned embodiments. 

Further embodiments of this invention are oligonucleotides specific for Glycoprotein B encoding 
sequences of the gamma herpes subfamily, the RFHV/KSHV subfamily, RFHV, and KSHV. especially 
those listed in SEQ. ID NOS:24-63. Also embodied are methods for obtaining an amplified copy of a 
polynucleotide encoding a Glycoprotein B, comprising contacting the polynucleotide with one or more of 

20 the aforemerrtioned oiigonudeotides. The polynucleotide to be amplified may be taken from an individual 
affected with a disease featuring fibroblast proliferation and collagen deposition, including but not limited to 
Retroperitoneal Fibromatosis or Kaposi's Sarcoma, or a malignancy of the lymphocyte lineage. 

Additiorial emtxxiiments of this invention are methods for detecting viral DNA or RNA in a sample. 
Or>e method comprises the steps of contacting the DNA or RNA in the sample with a prot>e comprising a 

25 polynucleotide or oligonucleotide of this invention under conditions that vi/outd permit the probe to iorm a 
stat>le duplex witii a polynucleotide having the sequence shown in SEQ. ID NO:1 or SEQ. ID NO:3. or 
t>oth, but not with a polynucleotide having a sequence of herpes viruses outside the RFHV/KSHV 
subfamily, particularty SEQ. ID NOS:5-13. and detecting the presence of any duplex fomned thereby. The 
conditions referred to are a single set of reaction parameters, such as incubation time, temperature, solute 

30 cor>centi3tions, and washing steps, that would permit the polynucleotide to form a stable duplex if 
alternatively contacted with a polynucleotide with SEQ. ID NO:1, or with a polynucleotide with SEQ. ID 
NO:3, or with both, but not vtrith a polynucleotide of any of SEQ ID NO:5-l3. Anoti^er method comprises 
the steps of amplifying the DNA or RNA in ttie sample using an oligonucleotide of this invention as a 
primer in the amplification reaction, and detecting the presence of any amplified copies. Also emtx}died 

35 are isolated polynucleotides identified by the aforementioned methods, as may t>e present in the genome 
of a naturally occurring virus or affected tissue . 

Further embodiments of this invention are diagnostic kits for detecting components related to 
herpes virus infection in a biological sample, such as may be obtained from an individual suspected of 
harboring such an infection, comprising a polynucleotide, oligonucleotide, polypeptide, or antitxxJy of this 
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invention in suitable packaging. Also emtxxjied are methods of detecting infection of an individual, 
compfising applying ttie reagents, methods, or kits of this invention on biological samples obtained from 
the individual. 

Still other embodiments of this invention are therapeutic compounds and compositions for use in 
5 treatment of an individual for infection by a gamma herpes vinjs. Included are therapeutic agents that 
comprise polynucleotides and vectors of this invention for the purpose of gene ttmrapy. Also included are 
phamiaceuticat compounds identified by contacting a polypeptide emtiodied in this invention with the 
compound and detemiining whether a biochemical function of the polypeptide is altered. Also included are 
pharmaceutical compounds obtained from rational drug design, based on structural and biochemical 
1 0 features of a Glycoprotein 8 molecule. 

BmgP DESCRIPTION OF THE DRAWINGS 

Figure 1 is a listing of polynucleotide sequences amplified from a Glycoprotein B encoding region of 

15 RFHV and KSHV. The 319-base polynucleotide segment between residues 36 to 354 is underlined, and 
represents the respective viral gene segment between the primers used to amplify it. Aligned with the 
polynucleotide sequences are oligonucleotides that may be used as hybridization probes or PGR primers. 
Type 1 oligonucleotides comprise a gamma herpes consensus sequence, and can be used to amplify a 
Glycoprotein B gene segment of a gamma herpes virus. Examples shown are NIVPA and TVNCB. Type 

20 2 oligonucleotides comprise a consensus sequence from the RFHV/KShlV subfamily, and can be used to 
amplify Glycoprotein B gene segment of a vims belonging to the subfamily. Examples shown are SHMDA, 
CPS SB, ENTFA and DNIQB. The other oligonucleotides shown are Type 3 oligonucleotides. These 
comprise sequences taken directly from the RFHV or KSHV sequence, and are specific for 5equerK:es 
from the respective vinjs. Oligonucleotides that initiate amplification in the direction of the coding 

25 sequence (with designations ending in 'A') are listed 5'-^3'. Oligonucleotides that initiate amplification in 
the direction opposite to that of the coding sequence (with designations endir^ in "B") are listed 3'-*5*. 
Also shown are the polypeptides encoded by the RFHV and KSHV polynucleotide sequences. The 
asparagine encoded by nucleotides 23&-240 in both sequences is a potential N-)inked glycosylation site 
conserved with other herpes viruses. 

30 Figure 2 is a map of the Glycoprotein B erx^ding DMA sequence believed to be contained in the 

KSHV genome, and other members of the RFHV/KSHV subfamily. Shown are the approximate location of 
the KSHV Glycoprotein B sequence described herein. Also shown are the putative conserved segments 
that represent hybridization sites for Type 1 consensus/degenerate oligonucleotides useful in probing and 
amplifying Glycoprotein B sequences from gamma herpes viruses 

35 Figure 3 is a listing of some previously known herpes virus Glycoprotein B protein sequences, 

aligned with the complete KSHV Glycoprotein B protein sequerwe and fragments of RFHV1 and RFHV2. 
Boxed regions indicate tf>e putative pre-processing signal sequertce and the transmembrane domain. 
Cysteine residues are underiined. Residues that are highly consen/ed amongst herpes virus Glycoprotein 
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B sequences are underscored with an asterisk (•). Cysteines appearing uniquely in the KSHV 
Glycoprotein B are underscored with a bullet (•) 

Figure 4 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 
viaises. showing a conserved region, and the Type 1 oligonucleotide FRFDA designed therefrom. 
5 Figure 5 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 

viruses, showing a conserved region, and the Type 1 oligonucleotides NIVPA and NIVPASQ designed 
therefrom. 

Figure 6 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 
viruses, showing a consen/ed region, and the Type 1 oligonucleotides TVNCA. TVNCB and TVNCBSQ 
1 0 designed therefrom. 

Figure 7 is a listing of previously krwwn Glycoprotein B polynucleotide sequences of gamma herpes 
viruses, showing a conserved region, and the Type 1 oligonudeotide FAYDA designed therefrom. 

Figure 8 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 
viruses, showing a conserved region, and the Type 1 oligonucleotides lYGKA and lYGKASQ designed 
15 therefrom. 

Figure 9 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 
viruses, showing a conserved region, and the Type 1 oligonucleotides CYSRA and CYSRASQ designed 
therefrom. 

Figure 10 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma 
20 herpes viaises. showing a conserved region, and the Type 1 oligonucleotides NIDFB and NIDFBSQ 
designed therefrom. 

Figure 11 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma 
herpes vinjses. showing a conserved region, and the Type 1 oligonucleotides FREYA. FREYB and 
NVFDA desigr>ed therefrom. 

25 Figure 12 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma 

herpes viruses, showing a conserved region, and the Type 1 oligonucleotide GGMA desigr>ed therefrom. 

Figure 1 3 is a listing of a porton of the Glycoprotein B polynucleotide sequence from RFHV and 
KSHV. aligned with previously known gamma herpes Glycoprotein B polynucleotide sequences. Each 
shared residue is indicated as a period. 

30 Figure 14 is a comparison listing of the polypeptide sequences of Glycoprotein B from various 

gamma herpes vimses, encoded t>etween the hybridization sites of NIVPA and TVNCB in the 
polynucleotide sequences. The Class II sequence fragments shown undertined are predicted to be 
RFHV/KSHV cross-reactive antigen peptides. The Class III sequences shown in lower case are predicted 
to be RFHV or KSHV viais-specific peptides. 

35 Figure 1 5 is an alignment of the pwlypeptide sequences of Glycoprotein B over a broader spectrum 

of herpes viruses in the gamma. t>eta, and alpha subfamilies. 

Figure 16 is a relationship map of Glycoprotein B, based on the polypeptide sequences shown in 
Figure 15. 

? 
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Figure 17 is a listing of exemplary Type 2 (subfamlly-spedfic) oligonudeobdes. aligned with the 
nucleotide sequences from which they were derived. 

Figure 18 is an approximate map of Glycoprotein B and DNA polymerase erx»ding regions as they 
appear in the KSHV genome, showing the hybridization position of oligonucleotide primers. 
5 Figure 19 is a listing of a KSHV DNA sequence obtained by amplifying fragments upstream and 

downstream from the sequence in Figure 1 . An open reading frame is shown for the complete KSHV 
Glycoprotein B sequence, flanked by open reading frames for the capsid maturation gene and DNA 
potymerase. Underlined in the nudeotide sequence is a putative Glycoprotein B promoter 

Figure 20 is a Hopp-Woods antigenicity plot for the 106 nudeotide Glycoprotein 6 potypeptide 
1 0 fragment of RFHV encoded between NfVPA and TVNCB. Indicated below are spans of hydrophobic and 
antigenic residues in the sequence. 

Figure 21 is a Hopp-Woods antigenicity plot for the 106 nudeotide Glycoprotein B polypeptide 
fragment of KSHV encoded between NIVPA and TVNCB. 

Figure 22 is a Hopp-Woods antigenicity plot for the comptete Glycoprotein B from KSHV. 
15 Figure 23 is a listing of DNA and protein sequences for a Gtycoprotein B fragment of a third 

member of the RFHV/KSHV subfamily, designated RFHV2. The 319-base polynudeotide segment 
t>etween residues 36 to 354 is underlrned. and represents the Glycoprotein B encoding segment between 
the primers used to amplify it. 

20 Best Mope for Carrying Out the Invention 

We have discovered and charaderized poiynudeotides encoding Glycoprotein B from herpes 
viruses of the RFHV/KSHV subfamily. The poiynudeotides, oligonudeotides. polypeptides and antibodies 
embodied in this invention are useful in the diagnosis, dinical monitoring, and treatment of herpes virus 

25 infections ar>d related conditions. 

The source for the polynudeotide for the RFHV Gtycoprotein B was affected tissue samples taken 
from Macaque nemesthna monkeys with retroperitoneal fibnamatosis fRF"). The polynudeotide for the 
KSHV Gtycoprotein B was obtained from affected tissue samples taken from humans with Kaposi's 
Sarcoma C*KS'). The tissues used for the present invention were known to contain genetic material from 

30 RFHV or KSHV. because they had previously been used successfully to dof>e corresponding DNA 
Polymerase encoding fragments. The amplification of the DNA Polymerase regions have t>een descrit>ed 
in commonly owned U.S. Patent Application 60/001 .146. 

In order to amplify the Glycoprotein B sequences from these samples, we designed 
oligonucleotides from those of other herpes viruses. Glycoprotein B is expected to be less welt conserved 

35 between herpes vinises. because it is externally exposed on the viral envek)pe and therefore under 
selective pressure from the immune system of the hosts they infed. Accordingly, the oligonudeotides 
were designed from sequences of herpes viruses believed to be most dosety related to RFHV and KSHV. 
These two viruses are known from the DNA polymerase sequences to t>e closely related gamma type 
herpes vinjses. 
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Oligonucleotides were designed primarity from Glycoprotein 6 sequences previously known for four 
gamma herpes viruses: sHV1. eHV2. bHV4, mHV68 and fiEBV. Comparison of the amino add 
sequences of these four Glycoprotein B molecules revealed nine relatively conserved regions. Based on 
the sequence data, oligonucleotides were constructed comprising a degenerate segment and a consensus 
5 segment, as descrit>ed in a following section. Three of these oligonucleotides have been used as primers 
in amplification reactions that have yielded fragments of the RFHV and KSHV Glycoprotein B encoding 
segments from the RF and KS tissue. 

The RFHV and KSHV polynucleotide sequence fragments obtained after the final amplification step 
are shown in Figure 1 (SEQ. ID NO:1 and SEQ. ID NO:3, respectively). Included are segments at each 

1 0 end corresponding to the hybridizing regions of the NIVPA and TVNCB primers used in the amplification. 
The fragment between the primer binding segments is 319 base pairs in length (residues 36-354), and 
believed to be an accurate reflection of the sequences of the respective Glycop)rotein B encoding regions 
of the RFHV and KSHV genomes. 

The 319 base pair Glycoprotein B encoding polynucleotide segment from RFHV is only 60% 

1 5 identical with that from sHV1 and bHV4, the most closely related sequences ftT>m outside the RFHV/KSHV 
subfamily. The 319 base pair polynucleotide segment from KSHV is only 63% identical with sHVl and 
bHV4. The segments are 76% identical between RFHV and KSHV. 

Also shown are the con^sponding predicted amino add sequences (SEQ. ID NO:2 and SEQ. ID 
NO:4). The polypeptide sequervces are novel, and are partiy homologous to Glycoprotein B sequences 

20 from other herpes viruses. The fragments shown are predicted to be atxxit 1/6 of the entire Glycoprotein B 
sequence. They t>egin about 80 amino adds downstream from the predicted N-temiinal methionine of the 
pre-processed protein There is a potential N-linked glycosylation site at position 80 of tr>e amino add 
sequence, according to the sequence Asn-Xaa-(Thr/Ser). This site is conserved t>etween RFHV and 
KSHV, and is also conserved amongst other known gamma herpes viruses. There is also a cysteine 

25 residue at position 58 ttiat is conserved across herpes viruses of the gamma, beta, and alpha subfamilies, 
which may play a role in maintaining the three-dimensional structure of the protein. 

The 106 amino add segment of Glycoprotein B encoded by the 319 base pairs between the 
amplification primers is 91% identical between RFMV and KSHV. but only 65% identical between KSHV 
and that of bHV4, the dosest sequence outside the RFHV/KSHV subfamily. 

30 Glycoprotein B molecules expressed by the RFHV/KSHV herpes virus subfamily are expected to 

have many of tiie properties described for Glycoprotein B of other herpes vimses. Glycoprotein B 
molecules are generally atx)ut 110 kDa in size, corresponding to at)out 600-900 amino adds or about 
2400-2700 base pairs. Hydrophobidty plots indicate regions from the N terminus to the C terminus in the 
following order a hydrophobic region con-esponding to a membrar>e-directing leader sequence; a mixed 

35 polarity region corresponding to an extracellular domain: a hydrophobic region corresponding to a 
transmembrane domain; and arwther mixed polarity region con-esponding to a cytoplasmic domain. 

The full sequence of the KSHV Glycoprotein B, shown in Figure 19. confirms these predictions: 
The gene encodes atooUt 645 amino adds including the signal peptide and a transmemtjrane region near 
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the C-termtnus. Cysteine residues are conserved with other Glycoprotein B sequences, and an additional 
potential disutfide may help stabilize the three-dimensional structure. 

Glycoprotein B is generally expressed on the envelope of infectious and defective viral particles, 
and on the surface of infected cells. It Is generally glycosylated, and may comprise 5-20 glycosylation 
5 sites or more. It is also generally expressed as a protein dimer, which assembles during translocation to 
the surface of the host cell, prior to budding of the virus. The site responsible for dimerization appears to 
be located between atx)iJt amirw add 475 and the membrane spanning segment (Navarro et al.). 

Previous studies have mapped several biochemical functions related to infectivity to different 
regions of the Glycoprotein B molecule. Glycoprotein B and Glycoprotein C are both implicated in initial 
10 binding of HSV1 and bovine herpes vims 1 to target cells (Herold et al., Byrne et al ). The moiety on the 
cells recognized by Glycoprotein B appears to be heparan sulfate; the binding is inhibrtable by fluid-phase 
heparin. Mutants that lack Glycoprotein C can still bind target cells, but mutants that lack both 
Gtycoprotein C and Glycoprotein B are severely impaired in their ability to gain access to the cells. 

Another apparently important function is the ability of Glycoprotein B to promote membrar>e fusion 
15 and entry of the vims into the cell. In human CMV, the fusogenic roie appears to map to the first 
hydrophobic domain of Glycoprotein B. and may be associated with conserved glycine residues within this 
region (Reschke et al.). In HSV1 mutants, the ability of Glycoprotein B to promote syncytia formation 
maps to multiple sites in the cytoplasmic domain of the protein, near the C-terminus (Kostal et al.). 

In order to exercise some of these more complicated functions, it seems likely that Glycoprotein B 
20 associates not only with a second Glycoprotein B molecule, but with other components encoded by the 
virus. For example, the UL45 gene product appears to be required for Glycoprotein B induced fusion 
(Haanes et al ). It has been hypothesized that Glycoprotein B cooperates with other surface proteins to 
form a hydrophobic fusion pore in the surface of the target cell (Pereira et al.). Glycoprotein B has been 
found to elicit a potent antibody response capable of neutralizing the intact virus. Monoclonal antibodies 
25 with neutralizing activity may be directed against many different sites on the Glycoprotein B molecule. 

Consequently, it is expected that the Glycoprotein B molecule bears sites that interact with the 
target celt, help promote fusion, and associate with other viral proteins It is predicted that Glycoprotein B 
molecules of RFHV/KSHV subfamily viarses will perform many of the functions of Glycopnstein B in other 
species of herpes virus, and bear active regions with some of the same properties Interfering with any of 
30 these active regions with a dnjg, an antibody, or by mutation, may impair viral infectivity or virulence 

Subsequent to discovery of the Glycoprotein B of RFHV and KSHV. a third member of the 
RFHV/KSHV subfamily was identified in a sample of affected tissue from a Macaca muiatta (Example 12). 
This Glycoprotein B is closely related but not identical to RFHV. and is designated RFHV2. It is predicted 
that other memt)ers of the RFHV/KSHV subfamily will emerge, including some that are pathogenic to 
35 humans. This disclosure teaches how new members of the subfamily can be detected and characterized. 

The homotogy between Glycoprotein B sequences within the RFHV/KSHV subfamily means that 
the polynucleotides and polypeptides embodied in this invention are reliable markers amongst different 
strains of the subfamily. The polynucleotides, polypeptides, and antibodies embodied in this invention are 
useful in such applications as the detection and treatment of viral infection in an individual, due to RFHV. 

li 
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KSHV. or other herpes viruses in the same subfamily. The polynucleotides, oligonucleotide probes, 
potypeptides. antibodies, and vaccine compositions relating to Glycoprotein B, and the preparation and 
use of these compounds, is descntwd in further detail in the sections that foltow. 

5 Abbrmvfations 



The following at>breviattons are used herein to refer to species of herpes vinjses, and 
polynucleotides and polypeptides derived thefefrom; 



TABLE 1 : Abbreyiations for Herpes Virus Strains ] 


Designation 


Virus 


Provtstonat Subfamily 
Assignment | 


RFHV 


simian Retroperrtoneal Fibromatosis- 
assodated Herpesvirus 


gamma-HerpesVirus 


KSHV 


human Kaposi's Sarcoma-assodated 
Herpesvirus 






mitPtnA tJdhf f /in i* Aft 
filUintO 1 Itff^Jco viruo OO 




bKV4 


bovine Herpesvirus 4 




eHV2 


equine Herpesvirus 2 




sHVI 


saimiri monkey Herpesvirus 1 




hEBV 


human Epstein-Barr Vims 




hCMV 


human CytoMegaloVirus 


beta-HerpesVirus 


mCMV 


murine CytoMegaloVirus 




gpCMV 


guinea pig CytoMegaloVirus 




hHV6 


human Herpesvirus 6 




hVZV 


human Varicella-Zoster Virus 


alpha-HerpesVinjs 1 


HSVl 


human Herpes Simplex Virus 1 




HSV2 


human Herpes Simplex Virus 2 




SHVSA8 


simian Herpesvirus AS 




eHVI 


equine Herpesvirus 1 




iHV1 


tctalurid catfish Herpesvirus 





10 

Genera! Definitions 

"Glycoprotein B" is a particular protein component of a herpes virus, encoded in the viral genome 
and believed to be expressed at the surface of the intact viats. Functional studies with certain species of 
15 herpes viats, especially HSVl, hCMV. and bovine herpes virus 1, have implicated Glycoprotein B in a 
numt)er of biochemical functions related to viral infecthrity. These include binding to components on the 
surface of target cells, such as heparan sulfate, fusion of the viral membrane with the membrane of the 

/A 
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target cell, penetration of the virat capsid into the cell, and formatton of potynudeated syncytial cells 
Glycoprotein B has been olwerved as a homodimer, and may interact with other viral surface proteins in 
order to exert some of its biochemical functions. Different biochemical functions, particularty heparan 
sutfate binding and membrane fusion, appear to map to different parts of the Glycoprotein B rrxDlecule. A 
5 Glycoprotein B molecule of other herpes viruses, including members of the RFHV/KSHV subfamily, may 
perfbrm any or all of these functions. As used herein, the term Glycoprotein B includes unglycosylated. 
partly glycosylated, and fully glycosylated forms, and both monomers and polymers. 

As used herein, a Glycoprotein B fragment, region, or segment is a fragment of the Glycoprotein B 
molecule, or a transcript of a sutxegion of a Glycoprotein B encoding polynucleotide. The intact 
10 Glycoprotein B molecule, or the fulMength transcript, will exert biochemical functions related to viral 
activity, such as those described above. Some or all of these functions may be preserved on the fragment, 
or the fragment may be from a part of the intact molecule which is unable to perform these functions on its 
own. 

'Glycoprotein B activity' refers to any biocfwnical function of Glycoprotein B, or any biological 
1 5 activity of a h»erp>es viois attributable to Glycoprotein B. These may include but are not limited to binding of 

the protein to cells, cell receptors such as heparan sulfate, and receptor analogs; viral binding or 

penetration into a cell, or cell fusion. 

The term "Glycoprotein B gene" refers to a gene comprising a sequence that encodes a 

Glycoprotein B molecule as defined above. It is understood that a Glycoprotein 8 gene may give rise to 
20 processed and altered translation products, including but not limited to forms of Glycoprotein B with or 

witftout a signal or leader sequervce, truncated or internally deleted forms, muttimeric fomns. and forms with 

different degrees of gtycosylation. 

As used herein, a 'DNA Polymerase' is a protein or a protein analog, that under appropriate 

conditions is capable of catalyzing the assemt}ly of a DNA polynucleotide with a sequence that is 
25 complementary to a polynucleotide used as a template. A DNA Polymerase may also have other catalytic 

activities, such as 3*-5' exonuclease activity; any of the activities may predominate. A DNA Polymerase 

may require association with additional proteins or co-factors in order to exercise its catalytic function. 

■RFHV is a vinjs of the herpes family detected in the tissue samples of Macaque nemestrina 

monkeys affected with Retroperitor>eal Fibromatosis (RF>. RFHV is synonymous with the terms 'RFHV1 ', 
30 "RFHVMn". and *RFMn'. "KSHV is a virus of the herpes virus family detected in the tissue samples of 

humans affected with Kaposi's Sarcoma (KS). A third member of the RFHV/KSHV subfamily is a vinjs 

identified in a M. mutatta monkey. The virus is referred to herein as *RFHV2'. 'RFHV2' is synonymous 

with the terms 'RFHVMm* and 'RFMm'. 

The 'RFHV/KSHV subfamily" is a term used tierein to refer to a coltectjon of herpes viruses capable 
35 of infecting vertebrate species. The subfamily consists of memt>ers that have Glycoprotein B sequences 

that are more closely related to that of the corresponding sequences of RFHV or KSHV than other herpes 

vinises, including sHV1, eHV2, bHV4, mHV58 and hEBV. Preferatjty, the polynudeotde encoding 

Glycoprotein B comprises a segment that is at least 65% identical to that of RFHV (SEQ. ID N0:1) or 

KSHV (SEQ. ID NO: 3) between residues 36 and 354; or at least about 74% identical to the oligonucleotide 
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SHMDA. or at least about 73% identical to the oUgonudeotide CFSSB, or at least about 72% identical to 
the nucleotide EKTFA. or at least about 60% identcal to the nucleotide DNIQB, RFHV and KSHV are 
exemplary members of the RFHV/KSHV subfamily. The RFHV/KSHV subfamily represents a subset of 
the gamma subtemily of herpes viruses. 
5 The temis "polynucleotide" and "oligonucleotide' are used interchangeably, and refer to a polymeric 

form of nucleotides of any length, either deoxyribonudeotides or ribonucleotides, or analogs thereof. 
Polynudeotides may have any three-dimensional structure, and may perfomn any function, known or 
unknown. The following are non-limiting examples of polynudeotides: a gene or gene fragment, exons. 
introns. messenger RNA (mRh4A). transfer RIMA, ribosomal RNA, ribozymes, cDNA. recombinant 

10 polynudeotides, branched polynudeotides, plasmids, vectors, isolated ONA of any sequence, isolated 
RNA of any sequence, nucleic add probes, and primers. A polynudeotide may comprise modified 
nudeotides, such as methylated nudeotides and nudeotide analogs. If present, modifications to the 
nudeotide structure may be imparted before or after assembly of the polymer. The sequence of 
nudeotides may be intenupted by non-nudeobde components. A polynudeotide may be further modified 

1 5 after polymerization, such as by conjugation with a labeling component. 

The term polynucleotide, as used herein, refers to both double- and stngle-stranded molecules. 
Unless othenwise spedfied or required, any embodiment of the invention described herein that is a 
polynudeotide encompasses both the double-stranded form and each of two complementary 
sir>gle-stranded forms known or predicted to make up the dout^le-stranded form. 

20 In the context of polynudeotides, a 'linear sequence" or a 'sequence' is an order of nudeotides in 

a polynucleotide in a 5' to 3' direction in which residues that neighbor each other in the sequence are 
contiguous in the primary structure of the polynudeotide. A "partial sequence' is a linear sequence of part 
of a polynudeotide which is known to comprise additk>nal residues in one or both directions. 

"Hybridization" refers to a reaction in which one or more polynudeotides react to form a complex 

25 that is stabilized via hydrogen bonding between the bases of the nudeotide residues. The hydrogen 
txjnding may occur by Watson-Crick base pairing, Hoogsteen binding, or in any other sequence-spedfic 
manner. The complex may comprise two strands fomning a duplex structure, three or more strands 
forming a mutti-stranded complex, a single self-hybridizing strand, or any combination of these. A 
hybridization reaction may constitute a step in a more extensive process, such as the initiation of a PCR, 

30 or tne enzymatic deavage of a polynudeotide by a ribozyme. 

Hyt>ridization reactions can be performed under conditions of different 'stringency'. Conditions that 
increase the stringency of a hybridization reaction are widely known and put>fished in the art: see. for 
example, Sambrook Frttsch & Maniatis. Examples of relevant conditions indude (in order of increasing 
stringency): incubation temperatures of 25'C, 37"C, 50"C. and 68*C; buffer concentrations of 10 x SSC. 6 

35 X SSC, 1 X SSC. 0.1 X SSC (where SSC is 0.15 M NaCl ar>d 15 mM dtrate buffer) and their equivalent 
using other buffer systems: formamide concerrtratiorw of 0%, 25%, 50%. and 75%; incubation times from 5 
min to 24 h: 1, 2. or more washing steps: wash incubation times of 1. 5. or 15 min; and vrash solutions of 6 
X SSC, 1 X SSC. 0.1 X SSC. or deionized water. 
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"T„' is the temperature in degrees CentigTade at which 50% of a polynucleotide duplex made of 
comptementary strands hydrogen txsnded in an antiparaJlel direction by Watson-Crick base paring 
dissociates Into single strands under the conditions of the experiment. may be predicted according to 
standard formula: for example: 

5 

T„ = 81.5 + 16.6 log [Na*] + 0.41 (%G/C) - 0.61 (%F) - 600/L 

where [Na*l is the cation concentration (usually sodium ion) in mol/L; (%G/C) is the number of G and C 
residues as a percentage of total residues in the duplex: (%F) is the percent fonnamide in solution (wtA^ol); 

1 0 and L is the numt>er of nucleotides in each strand of the duplex. 

A 'stable duplex" of polynucleotides, or a 'stable complex* fornied between any two or more 
components in a biochemical reaction, refers to a duplex or complex that is sufficiently tong-iasting to 
persist between the formation of the duplex or complex, and its subsequent detection. The duplex or 
complex must be able to withstand whatever conditions exist or are introduced between the moment of 

1 5 formation and the moment of detection, these conditions being a function of the assay or reaction which is 
being performed. Intervenir>g conditions which may optionally t>o present and which may dislodge a 
duplex or complex include washing, heatir^g, adding additior^al solutes or solvents to the reaction mixture 
(such as denaturants). and competing with addttior^l reacting species Stable duplexes or complexes 
may be irreversible or reversible, but must meet the other requirements of this definition. Thus, a transient 

20 complex may form in a reaction mixture, but rt does not constitute a stable complex tf it dissociates 
spontaneously or as a result of a newly imposed condition or manipulation introduced before detection. 

When stable duplexes form in an antiparallel configuration between two single-stranded 
polynucleotides, particulariy under conditions of high stringency, tiw strands are essentially 
"complementary". A doutsle-stranded polynucleotide can be "compte-mentary" to another polynucleotide, if 

25 a stable duplex can form between one of the strands of the first polynucleotide and the second. A 
complementary sequence predicted from the sequence of a single stranded polynucleotide is the optimum 
sequence of standard nucleotides expected to form hydrogen bonding with the single-stranded 
polynucleotide according to generally accepted ba&e-pairir>g rules 

A 'sense' strand and an 'antisense' stiand when used in the same context refer to single-stranded 

30 polynucleotides which are complementary to each other They may be opposing strands of a 
double-stranded polynucleotide, or one strand may be predicted from the other according to generally 
accepted base-pairir>g rules, tf not specified, the assignment of one or the other strand as 'sense' or 
'antisense' may be artxtrary. In relation to a potypeptide^ncoding segment of a polynucleotide, the 
'sense* strand is generally the strand comprising the encoding segment 

35 When comparison is made between polynucleotides for degree of identity, it is implicitly understood 

that complementary strands are easily generated, and the sense or antisense strand is selected or 
predicted that maximizes the degree of identity between the polynucleotides being compared. For 
example, where one or both of the poly nucleotides being compared is douWe-strarvSed. the sequences are 
identical if one strand of the first polynucleotide is identical with one strand of the second polynucleotide. 
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Simtlariy, when a polynucleotide probe is described as identical to its target, it is understood that it is the 
complementary strand of the target that participates in the hybridization reaction between the prot>e and 
the target 

A (inear sequence of nucleotides is 'essentiatty identical' to another linear sequence, if both 
5 sequences are capable of hybridizing to form duplexes with the same complementary polynucleotide. 
Sequences that hybridize under conditions of greater 8tringer>cy are more preferred. It is understood that 
hybridization reactions can accommodate insertions, deletions, and substitutions in the nucleotide 
sequefK^e. Thus, lirtear sequences of nucleotides can be essentially identical even if some of the 
nucleotide residues do not precisely con*espond or align. Sequences that correspond or align more dosely 

10 to the invention disclosed herein are comparably more preferred. Generally, a potynucleotide region of 
about 25 residues is essentially identical to another region, rf the sequences are at least about 85% 
identical; more preferably, they are at least about 90% identical; more preferably, they are at least about 
95% identical; still more preferably, the sequences are 100% identical. A polynucleotide region of 40 
residues or more will be essentially identical to another region, after alignment of homologous portions if 

15 the sequences are at least about 75% identical; more preferably, they are at least at>out 80% identical: 
more preferably, they are at least about 85% identical; even more preferably, they are at least about 90% 
identical; still more preferably, the sequences are 100% identical. 

In determining whether polynucleotide sequences are essentially identical, a sequence that 
preserves the functionality of the polynucleotide with which it is being compared is particularty preferred. 

20 Functionality can be determined by different parameters. For example, if the polynucleotide is to be used 
in reactions that involve hybridizing with ar>other polynucleotide, then preferred sequences are those which 
hybridize to the same target under simitar conditions. In general, the T„ of a DNA duplex decreases by 
at>out 1*C for every 1% decrease in sequence identity for duplexes of 200 or more residues; or by atxsut 
5*C for duplexes of less than 40 residues, depending on the position of the mismatched residues (see, 

25 e.g., Meinkoth et al.}. Essentially identical sequences of at»ut 100 residues will generally form a stable 
duplex with each other's respective complementary sequence at about 20*C less than T„,; preferably, they 
will form a stable duplex at about 15*C less; more preferably, they will form a stable duplex at about 10*C 
less; even more preferably, they will form a stable duplex at about 5*0 less; still more preferably, they will 
form a stable duplex at about T^. In another example, if the polypeptide encoded by the polynucleotide is 

30 an important part of its functionality, then prefened sequences are those which encode identical or 
essentially identical polypeptides. Thus, nucleotide differences which cause a conservative amino acid 
substitution are preferred over those which cause a non-conservative substitution, nucleotide differences 
which do not alter the amirw acid sequence are more preferred, while identical nucleotides are even more 
preferred. Insertions or deletions in the polynucleotide that result in insertions or deletions in the 

35 polypeptide are preferred over those that result in the down-stream coding region t)eing rendered out of 
phase; polynucleotide sequences comprising no insertions or deletions are even more piefened. The 
relative importance of hybridization properties and the encoded polypeptide sequence of a polynucleotide 
depends on the application of the invention. 
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A potynudeotide has the same 'characteristics' of another potynudeotide rf both are capable of 
forming a stable duplex with a particular third pol/nudeotide under similar conditions of maximal 
sthngency. Preferably, in addition to simitar hybridization properties, the polynucleotides also encode 
essentially identical polypeptides. 
5 "Conserved' residues of a potynudeotide sequence are those residues which occur unaltered in the 

same position of two or more related sequences being compared. Residues that are relatively conserved 
are those that are conserved amongst more related sequences or with a greater degree of identity than 
residues appearing elsewt>ere in the sequences. 

"Related' polynudeotides are potynudeotides that share a significant proportion of identical 
10 residues. 

As used herein, a 'degenerate' oligonudeotide sequence is a designed sequence derived from at 
least two related originating polynudeotide sequences as follows: the residues that are conserved in the 
originating sequences are preserved in the degenerate sequence, while residues that are not conserved in 
the originating sequences may t>e provided as several alternatives in the degenerate sequence. For 

15 example, the degenerate sequence AYASA may be designed from originating sequences ATACA and 
ACAGA, where Y is C or T and S is C or G. Y and S are examples of 'ambiguous' residues. A 
degenerate segment is a segment of a potynudeotide containing a degenerate sequence. 

It is understood that a synthetic oligonudeotide comprising a degenerate sequence is actually a 
mixture of dosely related oligonudeotides sharing an identical sequence, except at the ambrguous 

20 positions. Such an oligonudeotide is usually synthesized as a mixture of at) possible combinations of 
nudeotides at the ambiguous positions. Each of the digonudeotides in the mixture is referred to as an 
"alternative form*. The number of forms in the mixture is equal to 

n 

25 n ki 



where is the number of alternative nudeotides allowed at each position. 

As used herein, a "consensus' oligonudeotide sequence is a designed sequence derived from at 
30 least two related originating polynudeotide sequences as follows: the residues that are conserved in all 
originating sequences are preserved in the consensus sequence; while at positions where residues are not 
conserved, one alternative is chosen from amongst the originating sequer^es. In gerieral, the nucleotide 
chosen is the one v^rhich occurs In the greatest frequency in the originating sequerKes. For example, the 
consensus sequence AAAAA may be designed from originating sequences CAAAA, AAGAA. and AAAAT. 
35 A consensus segment is a segment of a polynudeotide containing a consensus sequence. 

A polynudeotide fragment' or 'insert* as used herein generally represents a sub-region of ttie 
fulMength form, t>ut the entire fulMength polynucleotide may also be included. 

Polynudeotides "correspond' to each other rf they are believed to be derived from each other or 
from a common ancestor. For example, encoding regions in the genes of different vimses correspond if 
40 they share a significant degree of identity, map to the same location of the genome, or encode proteins 
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that perform a similar biochemical function. Messenger RNA corresponds to the gene from which it is 
transcribed. cDNA corresponds to the RNA from which it has been produced, and to the gene that 
encodes the RNA. A protein corresponds to a polynucleotide encoding it, and to an antibody that is 
capable of binding it specificatly. 
5 A 'probe' when used in the context of polynucleotide manipulation refers to an oligonucleotide 

which Is provided as a reagent to detect a target potentially present in a sample of irnerest by hybridizing 
with the target. Usually, a probe will comprise a label or a means by wtiich a label can be attached, eitiner 
before or subsequent to the hybridization reaction. Suitable labels iridude, but are not limited to 
radioisotopes, fluorochromes, chemiluminescent compounds, dyes, and proteins, tncJuding enzymes. 

10 A "primer* is an oligonucleotide, generally with a free 3' -OH group, that binds to a target 

potentially present in a sample of interest by hybridizing with the target and thereafter promotes 
polymerization of a polynucleotide complementary to the target. 

Processes of producing replicate copies of the same polynucleotide, such as PCR or gene 
cloning, are collectively referred to herein as "amplification* or "replication". For example, single or dout>le- 

15 stranded DNA may be replicated to form another DNA with the same sequence. RNA may be replicated, 
for example, by an RNA-directed RNA polymerase, or by reverse-transcritjing the DNA and then 
perfomning a PCR. In the latter case, the amplified copy of the RNA is a ONA with the identical sequence. 

A "polymerase chain reaction' f PCR") is a reaction in which replicate copies are made of a target 
polynucleotide using one or more primers, and a catalyst of polymerization, such as a reverse 

20 transcriptase or a DNA polymerase, and particutariy a themiatly stable polymerase enzyme. Generally, a 
PCR involves rerteratively performirig three steps; "annealing", in which the temperature is adjusted such 
that oligonucleotide primers are permitted to form a duplex with the polynucleotide to be amplified; 
"elongating", in which Uie temperatijre is adjusted such that oligonucleotides that have fonned a duplex are 
elongated with a DNA polymerase, using the polynucleotide to which they've formed the duplex as a 

25 template; and "melting", in which the temperature is adjusted such that the polynucleotide and elongated 
oligonucleotides dissociate. The cyde is then repeated until the desired amount of amplified 
polynucleotide is obtained. Metiv>ds for PCR are taught in U.S. Patent Nos. 4,683.195 (Mullis) and 
4,683,202 (Multiset al.). 

A 'control element" or "control sequence' is a nucleotide sequence involved in an interaction of 
30 molecules that contributes to the functional regulation of a polynucleotide, including replication, duplication, 
transcription, splicing, translation, or degradation of the polynucleotide. The regulation may affect the 
frequency, speed, or specfficity of the process, and may be enhancing or inhit>itory in r\ature. Control 
elements are known in the art. For example, a 'promoter' is an example of a control element. A promoter 
is a DNA region capable under certain conditions of binding RNA polymerase and initiating transcription of 
35 a coding region located downstream {in the 3' direction) from the promoter. "Operatively linked" refere 
to a juxtaposition of genetic elements, wtierein the elements are in a relationship pemiitting them to 
operate in the expected manner. For instance, a promoter is operatively linked to a coding region if the 
promoter helps initiate transcription of the coding sequence. There may be inten/ening residues between 
the promoter and coding region so long as this functional relationship is maintained. 
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The terms •polypeptide', 'peptide' and "protein" are used interchangeably herein to refer to 
polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified 
amino acids, and it may be intemjpted by mn-amino acids. The temis also encompass an amino acid 
polymer that has been modified naturally or by intervention; for example, disulfide bond formation, 
5 giycosylation. lipidation, acetyiation, phosphorylation, or any other manipulation, such as conjugation with 
a latwiing component. 

In the context of polypeptides, a "linear sequence* or a "sequence' is an order of amino acids in a 
polypeptide in an N-terminal to C-temiina) direction in which residues that neighbor each other in the 
sequence are contiguous in the primary stnx:ture of the polypeptide. A 'partial sequence" is a linear 

1 0 sequence of part of a polypeptide which is kr>own to comprise additional residues in one or both directions. 

A linear sequence of amino adds is "essentially identical' to another sequence rf the two 
sequerx:es have a substantial degree of sequence identity. It is understood that the folding and the 
biochemical function of proteins can accommodate insertions, deletions, and sut^stitutions in the amtr>o 
add sequence. Thus, linear sequences of amino adds can be essentially Identical even if some of the 

1 5 residues do not predsely correspond or align. Sequences tt^t comespond or align more closely to tiie 
invention disdosed herein are more preferred. It is also understood that some amino add substitutions are 
more easily tolerated. For example, substitution of an amino add with hydrophobic side chains, aromatic 
side chains, polar side chains, side chains with a positive or negative charge, or side chains comprising 
two or fewer carbon atoms, by another amino add wvith a side chain of like properties can occur without 

20 disturbing the essential identity of the two sequences. Methods for determining homologous regions and 
scoring the degree of homology are well known in the art; see for example Aftschul et at. and Henikoff et al. 
Well-tolerated sequence differences are refen-ed to as 'conservative substitutions'. Thus, sequerwes with 
conservative substitutions are preferred over those with other substitutions in the same positions; 
sequerKes with identical residues at the same positions are still mors preferred. Generally, a 

25 polypeptide region will be essentially identical to another region, after alignment of homologous portions, if 
the sequences are at least about 92% iderrtical; mare preferably, tiiey are at least about 95% iderrtical; 
more preferably, they are at least about 95% identical and comprise at least another 2% which are either 
identical or are conservative substitutions; more preferably, they are at least at>out 97% identical; more 
preferably, they are at least about 97% ktentical, and comprise at least another 2% which are either 

30 identical or are conservative substitutions: more preferably, they are at least about 99% identical; still more 
preferably, the sequerx^es are 100% identical. 

In determining wtiether polypeptide sequences are essentially identical, a sequer>ce that 
preserves the functionality of the polypeptide with which it is being compared is particulariy preferred. 
Functionality may be established by different parameters, such as enzymatic activity, the binding rate or 

35 affinity in a sut>strate-enzyme or receptor-ligand interaction, ttie binding affinity with an antitxxjy. and X-ray 
crystallographtc structure. 

A polypeptide has the same 'characteristics' of another x'yp^ptide if it displays the same 
tMochemical function, such as enzyme activity, ligand binding, or antitxxfy reactivity Preferred 
characteristics of a polypeptide related to a Glycoprotein B or a Glycoprotein B fragment are the ability to 
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bind analogs of the cell surface receptor bound by Glycoprotein B of other herpes spedes, the ability to 
promote membrane fusion with a target cell, the ability to promote viral penetration of the host cell. Alsc 
preferred is a polypeptide that displays the same biochemical function as the polypeptide with wt^ch it rs 
being compared, arxl in addition, is believed to have a similar three-dimensional confomiation, as 
5 predicted by computer modeling or determined by such techniques as X-ray crystallography. 

The "biochemical function', "biological function" or "biological activity" of a polypeptide includes 
any feature of the polypeptide detectable by suitable experimental investigation. 'Altered* biochemical 
function can refer to a change in the primary, secondary, tertiary, or quatennary structure of the 
polypeptide; detectable, for example, by molecular weight determination, circular dichroism, antitody 

10 binding, difference spectroscopy, or nudear magnetic resonance. It can also refer to a change in 
reactivity, such as the ability to catalyze a certain reaction, or the ability to bind a co^ctor, substrate, 
inhibitor, drug, hapten, or other polypeptide. A substance may be said to "interfere" with the biochemical 
function of a polypeptide if it alters the biochemical fur>ction of the polypeptide in any of these ways. 

A "fusion polypeptide' is a polypeptide comprising regions in a different position in the sequence 

15 than occurs in nature. The regions may normally exist In separate proteins and are brought together in the 
fusion polypeptide; or they may normally exist in the same protein but are placed in a new arrangement in 
the fusion polypeptide. A fusion polypeptide may be created, for example, by chemical synthesis, or by 
creating and translating a polynudeotide in which the peptide regions are encoded In the desired 
relationship. 

20 An 'antibody' (interchangeably used in plural form) is an immurK>globutin molecule capable of 

specific binding to a target, such as a polypeptide, through at least one arrtigen recognition site, located in 
the variable region of the immunoglobulin molecule. As used herein, the term encompasses not only intact 
antibodies, but also fragments tt>ereof. mutants thereof, fusion proteins, humanized antibodies, and any 
other modified configuration of the immunoglobulin molecule that comprises an antigen recognition site of 

25 the required specificity. 

'Immunological recognition* or "immunological reactivity' refers to the specific binding of a target 
through at least one antigen recogr^ion site in an immunogtobuiin or a related molecule, such as a B cell 
receptor or a T cell receptor. 

The temr) 'antigen' refers to the target molecule that is spedfically bound by an antibody through 

30 its antigen recognition site. The antigen may. but need not be chemically related to the immunogen that 
stimulated production of the antitxxly. The antigen may be polyvalent, or it may be a monovalent hapten. 
Examples of kinds of antigens that can t>e recognized by antitxxlies indude polypeptides, polynudeotides. 
other antibody molecules, oligosaccharides, complex lipids, drugs, and chemicals. 

An "immunogen' Is a compound capable of stimulating production of an antibody when injected 

35 into a suitable host, usually a mammal. Compounds with this property are described as ~tmmur>ogenic'. 
Compounds may be rendered immunogenic by many techniques known in tiie art. induding crossltnking or 
conjugatir>g with a carrier to irKTease valency, mixing with a mitogen to increase the immune response, 
arKj combining with an adjuvant to enhance presentation. 
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A 'vaccine' is a pharmaoeutical preparation for human or animal use, which is administered with 
the intention of confening the recipient with a degree of specific immunological reactivity against a 
particular target, or group of targets. The immurK>logical reactivity may be antitxxlies or cells (particularly 
B cells, plasma cells, T helper cells, and cytotoxic T lymphocytes, and their precursors) that are 
5 immunologically reactive against the target, or any comt>ination thereof. Possil>le targets include foreign or 
pathological compounds, such as an exogenous protein, a pathogenic vims, or an antigen expressed by a 
cancer cell. The immunological reactivity may be desired for experimental purposes, for the treatment of a 
particular condition, for the elimination of a particular substance, or for prophylaxis against a particular 
condition or substance. Unless specifically indicated, a vaccine referred to herein may be either a passive 

1 0 vaccine or an active vaccine, or it may have the properties of both. 

A "passive vaccine' is a vaccine that does not require participation of the recipient's immune 
response to exert its effect. Usually, it is comprised of antitKxJy molecules reactive against the target. The 
antibodies may be obtained from a donor subject and sufficiently purified for administration to the recipient, 
or they may be produced in vitro, for example, from a culture of hyt>ridoma cells, or by genetically 

1 5 engineering a polynucleotide erxwding an antitxxJy molecule 

An 'active vacdr>e' is a vaccine administered with the intention of eliciting a specific immune 
response within the recipient, that in turn has the desired immunological reactivity against the target. An 
active vaccine comprises a suitable immunogen. The immune response that is desired may be either 
humoral or cellular, systemic or secretory, or any combination of these. 

20 A 'reagent' potynudeotide. polypeptide, or antibody, is a substance provided for a reaction, the 

substance having some known and desirable parameters for the reaction. 

A reaction mixture may also contain a 'target', such as a polynucleotide, antibody, or polypeptide 
Uiat the reagent Is capable of reactir>g with. For example, in some types of diagrrastic tests, the amount of 
the target in a sample is determined by addir^ a reagent, allowing the reagent and target to react, and 

25 measuring the amount of reaction product. In the context of clinical management, a target may also t>e a 
cell, collection of cells, tissue, or organ that Is the object of an administered substance, such as a 
phamnaceutical compound. A cell ttiat is a target for a viral infection is one to which a virus preferentially 
localizes for such purposes as replication or transformation into a latent form. 

An 'isolated* polynucleotide, polypeptide, protein, antibody, or otfier substance refers to a 

30 preparation of the substance devoid of at least some of the other components that may also be present 
wt>ere the substance or a similar substance naturally occurs or is initially obtained from. Thus, for example, 
an isolated substance may be prepared by using a purification technique to enrich it from a source mixture. 
Enrichment can be measured on an absolute t>asis. such as weight per volume of solution, or it can be 
measured in relation to a second, potentially interfering substance present in the source mixture. 

35 Increasing enrichments of the emtxxliments of this invention are increasingly more preferred. Thus, for 
example, a 2-fold enrichment is prefen^d, 10-fold enrichment is more preferred, 100-fold enrichment is 
more preferred, lOOO-fold enrichnwit is even more preferred. A substance can also t>e provided in an 
isolated state by a process of artificial assembly, such as by chemical synthesis or recombinant 
expression. 

^1 
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A potynucleotide used in a reaction, such as a probe used in a hybridization reaction, a primer 
used in a PGR, or a potynucJeotide present in a pharmaceutical preparation, is referred to as 'specific* or 
'selective' if it hybridizes or reacts with the intended target more frequently, more rapidly, or with greater 
duration than it does with alternative substances. Similariy. a polypeptide is referred to as "specific" or 
5 'selective* if it binds an intended target, such as a iigand, hapten, substrate, antibody, or other polypeptide 
more frequently, more rapidly, or with greater duration ttian it does to altemative substances. An antibody 
is refenBd to as 'specific' or 'selective' if it binds via at least or>e antigen recognition site to the intended 
target more frequently, more rapidly, or with greater duration than it does to altemative substances. A 
polynudeotide. polypeptide, or antitxxly is said to 'selectively inhibiT or "selectively interfere with" a 

1 0 reaction If it inhibits or interferes with the reaction t}etween particular substrates to a greater degree or for a 
greater duration than it does with the reaction between altemative substrates. 

A "phannaceutical carKlidate" or "drug candidate' is a compwund twiieved to have therapeutic 
potential, that is to be tested for efficacy. The 'screening' of a pharmaceutical carididate refers to 
conducting an assay that is capable of evaluating the efficacy and/or specificity of the candidate. In this 

1 5 context, 'efficacy' refers to the ability of the candidate to affect tiie cell or organism rt is administered to in a 
beneficial way. for example, the limitation of the pathology due to an invasive virus. 

The 'effector component" of a phamiaceutical preparation is a component which modifies target 
cells by altering their function in a desirable way when administered to a subject bearing the cells. Some 
advanced pharmaceutical preparations also have a "targeting component', such as an antitxxiy, which 

20 helps deliver the efFector component more efficaciously to the target site. Depending on the desired action, 
the effector component may have any one of a number of modes of action. For example, it may restore or 
enhance a nomnal fur>ction of a cell, it may eliminate or suppress an abnomnal function of a cell, or it may 
alter a cell's phenotype. Alternatively, it may kill or render dormant a cell with pathological features, such 
as a virally infected cell. Examples of efFector components are provided in a later section. 

25 A "cell tirw' or "cell culture" denotes higher eukaryotic cells grown or maintained in vitro. It is 

understood that the descendants of a cell may not be completely identical {either morphologically, 
genotypically, or phenotypically) to the parent cell. 

A "host cell" is a cell which has t>een transfomfied, or is capable of t>eing transformed, by 
administration of an exogenous polynucleotide. A "host cell* includes progeny of the original transformant. 

30 'Genetic alteration' refers to a process wtwrein a genetic element is introduced into a cell other 

than by natural cell division. The element may be heterologous to the cell, or it may be an additional copy 
or improved version of an element already present in tt\e cell. Genetic alteration may be effected, for 
example, by transfectir^g a cell with a recombinant plasmid or other polynudeotide through any process 
known in the art. such as electroporation, calcium phosphate precipitation, contacting with a 

35 polynucleotide-liposome complex, or by transduction or infection with a DMA or RIMA viais or viral vector. 
The alteration is preferably txjt not necessarily inheritable by progeny of the altered cell. 

An "individual" refers to vertebrates, particularty members of a mammalian species, and indudes 
but is not limited to domestic animals, sports animals, and primates, induding humans. 
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The term "primate' as used herein refers to any member of the highest order of mammalian 
species. This includes (but is not limited to) prosimians, such as lemurs and torises; tarsioids, such as 
tarsiers; new-world monkeys, such as squirrel monkeys (Saimiri sdureus) and tamarins; old-world 
monkeys such as macaques (including Macaca nemestnna. Macaca fasaculahs, and Macaca fuscata)\ 
5 hylobatids. such as gibbons and siamangs; pongids, such as orangutans, gorillas, and chimparuees; and 
hominlds, Including humans. 

The 'pathology* caused by a herpes virus infection is anything that compromises the well-being or 
normal physiology of the host. This may involve (but is not limited to) destnjctive invasion of the virus into 
previously uninfected cells, replication of the virus at the expense of the normal metabolism of the celt. 
1 0 generation of toxins or other unnatural molecules by the vinjs. irregular growth of cells or Intercellular 
structures (including fibrosis). in%gular or suppressed tMological activity of infected cells, malignant 
transformation, interference with the normal function of r>eighboring cells, aggravation or suppression of an 
inflammatory or immunological response, and increased susceptibility to other pathogenic organisms and 
conditions. 

15 Treatment' of an individual or a cell is any type of intervention in an attempt to alter the natural 

course of the individual or cell. For example, treatment of an Individual may be undertaken to decrease or 
limit the pathology caused by a herpes virus infecting the individual. Treatment includes (but is not limited 
to) administration of a composition, such as a pharmaceutical composition, and may toe performed either 
prophylactically, or therapeutically, subsequent to the initiation of a pathologic event or contact with an 

20 etiologic agent. 

It is understood that a clinical or biological 'sample* encompasses a variety of sample types 
obtained from a subject and useful in an in vitiv procedure, such as a diagnostic test. The definition 
encompasses solid tissue samples obtained as a surgical removal, a pathology specimen, or a biopsy 
specimen, tissue cultures or cells derived therefrom and the progeny thereof, and sections or smears 

25 prepared from any of these sources. Non-limrtirvg examples are samples obtained from infected sites, 
fibrotic sites, unaffected sites, and tumors. The definition also erKX)mpasses blood, spinal fluid, and other 
liquid samples of biologic origin, arvj may refer to either the cells or cell fragments suspended therein, or to 
the liquid medium and its solutes. The definition also includes samples that have t>een solubilized or 
enriched for certain components, such as OUA, RNA, protein, or antibody. 

30 Oligonucleotide primers and prot>es descrit>ed herein have t>een named as follows: The first part 

of the designation is the single amino acid code for a portion of the conserved region of the polypeptide 
they are based upon, usually 4 residues long. This is followed with the letter A or B, indicating respectively 
that the oligonucleotide is complementary to the sense or anti-sense sti^nd of the encoding region. 
Secorxlary consensus oligonucleotides used for sequencing and labeling reactions have the letters SQ at 

35 the end of tiie designation. 



^3 



wo 97/12042 



PCT/US96/15702 



GonormI techniques 

The practice of the present invention will employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, recombinant DNA, and immunology, which are within the 
5 skill of the art. Such techniques are explained fully in the literature. See, for example, 'Molecular Cloning; 
A Laboratory Manual', Second Edition (Sambrook, Fritsch & Maniatis, 1989), 'Oligonucleotide Synthesis' 
(M.J. Gait, ed.. 1984). 'Animal Cell Culture' (R.I. Freshney. ed., 1987): the series 'Methods in Enzymotogy" 
(Academic Press. Inc.); 'Harxlbook of Experimental Immunology' (D.M. Weir & C.C. Btackwell, eds.), 
'Gene Transfer Vectors for Mammalian Cells" (J.M. Miller & M P. Calos, eds.. 1987). 'Current Protocols in 
10 Molecular Biology' (F.M. Ausubei et at., eds., 1987); and 'Cunem Protocols in Immunology' (J.E. Coligan 
etal.,eds., 1991). 

All patents, patent applications, articles and publications mentioned herein, both supra and infra, 
are hereby Incorporated herein by reference. 

1 5 Po/ynuc/aotJdes encoding Giycoprotein B of the herpes vinjs RPHV/KSHV subfamily 

This invention embodies Isolated potynudeotide segments derived from Glycoprotein B genes 
present in herpes viruses that erKOde a fragment of a Glycoprotein B polypeptide. The polyr\ucleotides 
are related to the RFHV/KSHV subfamily of herpes viruses. Exemplary polynucleotides ervcode 

20 Glycoprotein B fragments from either RFHV or KSHV. Preferred fragments irwiude those shown in Figure 
1 , and subfragments thereof, obtained as described in the Example section below. Especially prefened is 
the potynudeotide comprising the sequer>ce twtween residues 36-364 of SEQ. ID N0:1 , SEQ. ID N0:3, or 
SEQ. ID NO:96. or pdynucieotides contained in SEQ. ID NO:92. 

The potynudeotide segments of RFHV and KSHV between residues 36 and 354 are 76% identical. 

25 Shared residues are irxjicated in Figure 1 by '*'. The lorigest subregions that are identically shared 
between RFHV and KSHV within this segment are 15. 17. and 20 nudeotides in length. 

The 319 base pair fragments of RFHV and KSHV between the amplification primer binding sites are 
more identical to each other than either of them are to that of any previously sequenced herpes virus. The 
next most dosely related sequer^s are sHVI arnj bHV4, which are 63% identical to the corresponding 

30 sequence of KSHV. and 60% identical to the corresponding sequence of RFHV. The longest number of 
consecutive bases shared between the Glycoprotein B fragment and any of the previously sequenced 
viruses is 14. It is believed that any subfragment of the RFHV or KSHV sequence of 16 base pairs or 
longer wilt be unique to the RFI-(V/KSHV subfamily, or to particular herpes virus spedes and variants 
within the subfamily. 

35 This invention embodies subfragments contained in the Glycoprotein B gene of the RFHV/KSHV 

subfamily, preferably contained in the region corresponding to the 319 base pair fragment between 
residues 36-354 of SEQ. ID NO:1. SEQ. ID N0:3. or SEQ. ID NO:96. or anywhere in SEQ. ID NO;92. 
Preferably, the sub-fragments are at least about 16 nucleotides in length: more preferably they are at least 
18 nudeotides in length; more preferably they are at least 21 nudeotides in lengtin: more preferably they 

<3Y 
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are at least about 25 nudeotides in length; more preferably they are at least about 35 nucleotides in length; 
still more preferably they are at least about 50 nucleotides in length; yet more preferably they are at least 
about 75 nudeotides in length, and even more preferably they are 100 nudeotides in length or more. Also 
embodied in this invention are polynudeotkies comprising the entire open reading frame of each 
5 respective herpes viais Glycoprotein B. 

The RFHV/KSHV subfamily consists of members that have sequences that are more dosely 
identical to the corresponding sequences of RFHV or KSHV, than RFHV or KSHV are to any other virus 
listed in Table 1 . Preferred members of the family may be identified on the basis of the sequence of the 
Glycoprotein B gene in the region corresponding to that of Figure 1. Table 2 provides the degree of 
1 0 sequer>ce identities in this region: 



1 TABLE 2: Sequence Identtties Between Glycoprotein B of KSHV and other Herpes Viruses 






Identity to polynucleotide fragntent: 


Glycoprotein B 
Sequertce 


ScQ. ID NO: 


RFHV 
(SEa ID NO:1) 
Bases 36-354 


KSHV 
(SEa ID NO;3) 
Bases 36-354 


RFHV 


1 


(100%) 


76% 


RFHV/KSHV 
subfamily 








KSHV 


3 


76% 


(100%) 


Other gamma sHV1 
herpes vimses 


S 


60% 


63% j 


bHV4 


6 


60% 


63% 


eHV2 


7 


52% 


54% 1 


mHVBB 


8 


56% 


54% 1 


hEBV 


9 


<50% 


52% j 


— 

alpha and beta hCMV 
herpes viruses 


10 1 


< 50% 


<50% 


hHVB 


11 


<50% 


< 50% 


hVZV 


12 


<50% 


< 50% 


HSV1 


13 j 


<50% 


<50% 



The percentage of sequence identity is calculated by first aligning the encoded amino add 
15 sequence, determining the corresponding alignment of the encoding polynudeotide, and then counting 
the number of residues shared between ttw sequences t>eing compared at each aligned position. No 
penalty is imposed for the presence of insertions or deletions, but insertions or deletions are permitted 
only where required to accommodate an obviously increased number of amino acid residues in one of 
the sequences t>eing aligned. Offsettirtg insertions just to improve sequence alignment are not 
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permitted at either the polypeptide or polynucleotide level. Thus, ar^y insertions in the polynucleotide 
sequence will have a length which is a multiple of 3. The percentage is given in terms of residues in the 
test sequence that are identical to residues In the comparison or reference sequence. 

Preferred Glycoprotein B encoding polynucleotide sequences of this invention are those derived 
5 from the RFHV/KSHV herpes virus subfamily. They include those sequences that are at least 65% 
identical with the RFHV or KSHV sequence between bases 36 and 354; more preferably, the 
sequences are at least 67% identical; more preferably, the sequences are at least about 70% identical; 
more preferably, the sequences are at least about 75% identical; more preferably, the sequences are at 
least atxjut 80% identical; more preferably, the sequences are at least about 85% identical; more 

10 preferably, the sequences are at least at»ut 90% identical; even more preferably, the sequences are 
over 95% identical. Also included are Glycoprotein B encoding regions that are upsti-eam or 
downstream of a region fulfilling the identity criteria indicated. 

Other preferred Glycoprotein B encoding polynucleotide sequences may be identified by the 
percent identity with RFHV/KSHV subfamily-specific oligonucleotides (Type 2 oligonucleotides) 

1 5 described in more detail in a later section. The percent identity of RFHV and KSHV Glycoprotein B with 
exemplary Type 2 oligonucleotides is shown in Table 3: 



TABLE 3: Sequence ld«ntttte« Between Gtycoprotetn B of Select Herpes Viruses and 
RFHV/KSHV Subfwniry Spectnc Oligonucleotides 


Glycoprotein B 
Sequence 


SEQ. ID 
NO: 


Identity to 
SHMDA 

(SEa ID 
NO:41) 


Identity to 
CFSSB 

(SEa ID 
N0^43) 


Identity to 
EKTFA 

(SEa ID 
NO:4£) 


identity to 
DNtQB 

(SEa ID 
NO:46} 


RFHV 


1 


91% 


91% 


89% 


91% 


KSHV 


3 


100% 


85% 


89% 


97% 


sHV1 


5 


71% 


70% 


66% 


66% 


bHV4 


6 


57% 


64% 


69% 


74% 


eHV2 


7 


57% 


61% 


54% 


60% 


mHVBB 


8 


<50% 


55% 


54% 


77% 


hEBV 


9 


57% 


55% 


60% 


51% 


hCMV 


10 


57% 


55% 


60% 


51% 


hHV6 


11 


<50% 


52% 


60% 


57% 


hVZV 


12 


54% 


58% 


66% 


57% 


HSVl 


13 


57% 


60% 


54% 


54% 



Percent identity is calculated for oligonucleotides of this length by not allowing gaps in either the 
20 oligonucleotide or the polypeptide for purposes of alignment. Throughout this disclosure, whenever at 
least one of two sequences t>eing compared is a degenerate oligonucleotide comprising an ambiguous 
residue, the two sequences are identical if at least one of the alternative forms of the degenerate 
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oligonucleotide is identical to the sequence with which it is being compared. As an ittustration, AY AAA 
is 1 00% identical to ATAAA. since AYAAA is a mixture of ATAAA and ACAAA. 

Preferred Glycoprotein B encoding sequences are those which over the corresponding region 
are at least 72% identical to SHMDA; more preferably they are at least 74% identical: more preferably 
5 they are at least about 77% identical; more preferably they are at least about 80% (dentlcat; more 
preferably they are at least about 85% identical; more preferably they are at least about 91% identtcal. 
Other preferred Glycoprotein B er>coding sequences are those which over the corresponding region are 
at least 71% identical to CFSSB; more preferably they are at least 73% identical: more preferably they 
are at least about 77% identical; more preferably they are at least about 80% identica); more preferably 

1 0 they are at least about 85% identical. Other preferred Glycoprotein B encoding sequences are those 
which over the corresponding region are at least 70% identical to ENTFA; more preferably they are at 
least 72% identical; more preferably they are at least about 75% identical; more preferabty they are at 
least about 80% identical: more preferabty they are at least about 85% identical; even more preferably, 
they are at least about 69% identical. Other prefen^d Glycoprotein B encoding sequences are those 

1 5 which over the con-esponding region are at least about 78% identical to DNIQB; more preferably they 
are at least 80% identical; more preferabty they are at least about 85% identical: more preferabty they 
are at least about 91% identical. Also included are Glycoprotein B encoding regions that are upstream 
or downstream of a region fulfilling the identity criteria indicated. 

Glycoprotein B encoding sequences from memt>ers of the RFHV/KSHV subfamily identified by 

20 any of the aforementioned sequence comparisons, using either RFHV or KSHV sequences, or the 
subfamily- specific oligonucleotides, are equally preferred. Exemplary sequences are the Glycoprotein 6 
encoding sequences of RFHV and KSHV. Also embodied in this invention are fragments of any 
Glycoprotein B encoding sequences of the subfamily, and longer polynucleotides comprising such 
polynucleotide fragments. 

25 The polynucleotide sequences described in this section provide a basis for obtaining the 

synthetic oligonucleotides, proteins and antibodies outlined in the sections that follow. These 
compounds may be prepared by standard techniques known to a practitioner of ordinary skill in the art. 
and may be used for a number of investigative, diagnostic, and therapeutic purposes, as described 
below. 

30 

Preparation of potynucleotides 

Polynucieotides and oligonucleotides of this invention may be prepared by any suitable method 
known in the art. For example, otigonudeotide primers can be used In a PGR ampfificatjon of ONA 
35 obtained from herpes vims infected tissue, as in Example 3 and Example 11, described twlow. 
ARematively. oligonucleotides can be used to identify suitable bacterial clones of a DHA library, as 
described below in Example 8. 

Polynucleotides may also t>e prepared directly from the sequence provided herein by chemical 
synthesis. Several methods of synthesis are krxswn in the art includir>g the triester method and the 
40 phosphite method. In a preferred method, polynucleotides are prepared by solid-phase synthesis using 

37 
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mononucleoside phosphoramidite coupling units. See, for example Horise et al., Beaucage et al., Kumar 
et ai.. and U.S. Patent No. 4.415.732. 

A typical solid-phase synthesis involves reiterating four steps: deprotection. coupling, capping, and 
oxidation. This results in the stepwise synthesis of an oligonucleotide in the 3' to 5' direction. 
5 In the first step, the growing oligonucieotide. which Is attached at the 3'-end via a (-0-} group to a 

solid support, is deprotected at the 5" end. For example, the 5' end may be protected by a -ODMT group, 
formed by reacting with 4,4'Kiimethoxytrityt chloride (DMT-CI) in pyhdtne. This group is stable under basic 
conditions, but is easily removed under acid conditions, for example, in the presence of dichloroacetic acid 
(DCA) or trichloroacetic acid (TCA). Deprotedion provides a 5' -OH reactive group. 
10 In the second step, the oligonucleotide is reacted with the desired nucleotide monomer, which itself 

has first been converted to a 6'-protected. 3'-phosphoramidite. The 5' -OH of the monomer may be 
protected, for example, in the form of a -ODMT group, and the 3 -OH group may t>e converted to a 
phosphoramidite, such as -OP(0R')NR2: wtiere R is the isopropyl group -CHiCH^^, a"*^ R' 'S. for example. 
-H (yielding a phosphoramidite diester). or -CHa. -CH2CH3. or the beta-cyanoethyl group -CHjCHjCN 
1 5 (yielding a phosphoramidite triester). The 3'-phosphoramiditB group of the monomer reacts with the 5' -OH 
group of the growing oligonucleotide to yield the phosphite linkage 5'-0P(0R')0-3'. 

In the third step, oligonucleotides that have not coupled with the monomer are withdrawn from 
further synthesis to prevent tf>e formation of Incomplete polymers. This is achieved by capping the 
remaining 5' -OH groups, for example, in the form of acetates (-0C(0}CH3.} by reaction with acetic 
20 anhydride {CH3C{0>-0-C{0)CH3). 

In the fourth step, the newly formed phosphite group (i.e., 5*-OP(OR')0-3') is oxidized to a 
phosphate group (i.e., 5'-0P(=0)(0R')0-3'): for example, by reaction with aqueous iodine and pyridine. 

The four-step process may then be reiterated, since the oligonucleotide obtained at the end of the 
process is 5'-protected and is ready for use in step one. When the desired fulHength oligonucleotide has 
25 been obtained, it may be deaved from the solid support, for example, by treatment with alkali and heat. 
This step may also serve to convert phosphate triesters (i.e.. when R' is not -H) to the phosphate diesters 
(-0P(=0)20-). and to deprotect base-labile protected amirw groups of the nucleotide bases. 

Potynucieotides prepared by any of these methods can be replicated to provide a larger supply by 
any standard technique, such as PGR amplification or gene cloning. 

30 

Cloning and expression vectors comprising a Gtycoprotetn B encoding polynucleotide 

Clonir>g vectors and expression vectors are provided in this invention that comprise a sequence 
encoding a herpes virus Glycoprotein B or variant or fragment thereof. Suitable cloning vectors may be 
35 constructed according to star>dard techniques, or may be selected from the large numt>er of cloning 
vectors available in the art. While the cloning vector selected may vary according to the host cell intended 
to be used, useful cloning vectors will generally have the ability to self-replicate, may possess a single 
target for a particular resection endonuclease. and may carry genes for a marker that can be used in 
selecting transfected clones. Suitable examples include plasmids and bacterial vimses; e.g.. pUC18, 
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mp18, mp19, pBR322, pMB9, CoiE1 , pCR1, RP4, phage DNAs, and shuttle vectors like pSA3 and pAT26. 

Expression vectors generally are replicabte polynucleotide constructs that encode a polypeptide 
operatively linked to suitable transcriptional and translational controlling elements. Examples of 
5 transcriptional controlling elements are promoters, enhancers, transcription initiation sites, and 
transcription tenninatlon srtes. Examples of translational controlling elements are ritx^some binding sites, 
translation initiation sites, and stop codons. Protein processing elements may also be Included: for 
example, regions that encode leader or signal peptides and protease cleavage sites required for 
translocation of tt>e polypeptide across the membrane or secretion from the ceil. The elements employed 
1 0 would be functional in the host cell used for expression. The controlling elements may be derived from the 
same Glycoprotein B gene used In the vector, or they may be heterologous (i.e.. derived from otfier genes 
ar>d/or other organisms). 

Polynucleotides may t>e inserted Into host cells by any means krx>wn in the art. Suitable host cells 

include bacterial cells such as E. coii, mycobacteria, other prokaryotic microorganisms and eukaryotic cells 
1 5 (including fungal cells, insect cells, plant cells, and animal cells). The celts are transformed by inserting the 

exogenous polynucleotide by direct uptake, endocytosis. transfection. f-mating. or electroporation. 

Subsequently, the exogenous polynucleotide may be maintained within the cell as a norvintegrated vector, 

such as a plasmld. or may alternatively be integrated into the host cell genome. 

Cloning vectors may t>e used to obtain replicate copies of the potynucteotides they contain, or as a 
20 means of storing the polynucleotides In a depository for future recovery. Expression vectors and host cells 

may be used to obtain polypeptides transcribed by the polynudeotides they contain. They may also be 

used in assays where it is desirable to have intact cells capable of synthesizing the polypeptide, such as in 

3 drug screening assay. 

25 Synthetic Type 1 oiigonucleotidBS for Glycoprotein B of gamma herpes virus 

Oligonucleotides designed from sequences of herpes vims Glycoprotein B. as embodied In this 
invention, can be used as probes to identify related sequences, or as primers in an amplification reaction 
such as a PGR. 

30 Different oligonucleotides with different properties are described in the sections that follow. 

Oligonucleotides designated as Type 1 are designed from previously known gamma herpes virus 
Glycoprotein B polynucleotide sequences. They are designed to hyt>ridlze with polynucleotides encoding 
any gamma herpes virus Glycoprotein B. and may be used to detect previously known species of gamma 
herpes virus. They may also be used to detect and characterize new species of gamma herpes virus. 

35 Oligonucleotides designated as Type 2 are designed from the RFHV and KSHV Glycoprotein B 
polynucleotide sequertces togettier. They are designed to hybridize with polynucleotides encoding 
Glycoprotein B of the RFHV/KSHV sutjfamily, including but not limited to RFHV and KSHV. 
Oligonucleotides designated as Type 3 are designed from RFHV or KSHV Glycoprotein sequences that 
are relatively unique to the individual virus. They are designed to hybridize specifically witii 

40 polynucleotides encodir>g Glycoprotein B only from RFHV or KSHV and closely related viral strains. 

^9 
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Some preferred examples of Type 1 oligonucleotides are listed in Table 4. These oligonudeotides 
have a specificity for Glycoprotein B ertcodirtg polynucleotides of a broad range of herpes viruses. 



TABLE 4: Type 1 Oltgonudeotides used for Detecting, Amplifying, or Characterizing Herpes Virus 

Polynucleotldss encoding Glycoprotein B 


Target: Herpes Giyooprotsin B, especially from gamma Herpesviruses 


Deeig- 

1 nation 


Sequence 

(S'toT) 


Length 


No. of 
forms 


Orierv 
tstion 


SEQ 

ID: 


FRFOA 


GCTGTTCAGATTTGACTTAGAYMANMCNTGYCC 


33 


256 


sense 


13 


NIVPA 


GTGTACAAGAAGAACATCGTGCCNTAYATNrTTYA 
A 


32 


64 


sense 


14 


NIVPASQ 


GTGTACAAGAAGAACATCGTGCC 


23 


1 




15 


TVMCB 


AACATGTCTACAATCTCACARTTNACNGTNGT 


32 


126 


anti- 
sense 


16 


TVNCBSQ 


AACATGTCTACAATCTCACA 


20 


1 




17 


FAYDA 


AATAACCTCTTTACGGCCCAAATTCARTWYGCN 
TAYGA 


38 


64 


sense 


18 


lYGKA 


CCAACGAGTGTGATGTCAGCCATTTAYGGNAAR 
CCNGT 


38 


64 


sense 


19 


lYGKASQ 


CCAACGAGTGTGATGTCAGCC 


21 


1 




20 


CYSRA 


TGCTACTCGCGACCTCTAGTCACCTTYAARTTYR 

TKIAA 


38 


64 


sense 


21 


CYSRASQ 


TGCTACTCGCGACCTCTAGTCACC 


24 


1 




22 


NIDFB 


ACCGGAGTACAGTTCCACTGTYTTRAARTCDATR 
TT 


36 


48 


anth 
sense 


23 


NIDFB5Q 


TGTCACCTTGACATGAGGCCA 


21 


1 




24 


FREYA 


TTTGACCTGGAGACTATGTTYMGNGARTAYAA 


32 


64 


sense 


25 


FREYB 


GCTCTGGGTGTAGTAGTTRTAYTCYCTRAACAT 


33 


16 


anti- 
sense 


26 


NVFDB 


TCTCGGAACATGCTCTCCAGRTCRAAMACRTT 


32 


32 


anti- 
sense 


27 


GGMA 


ACCTTCATCAAAAATCCCTTNGGNGGNATGYT 


32 


128 


sense 


28 


TVNCA 


TGGACTTACAGGACTCGAACNACNGTNAAYTG 


32 


128 


sense 


29 



The orientation indicated in Table 4 is relative to the encoding region of the polynucleotide. 
Oligomers with a 'sense' orientation will hybridize to the strand antisense to the coding strand and 
Initiate amplification in the direction of the coding sequence. Oligomers with an 'antisense' orientation 
will hybridize to the coding strand and initiate amplification in the direction opposite to the coding 
sequence. 

These oligonucleotides have t>een designed with several properties in mind; 1) sensitivity for 
target DNA even when present in the source material at very low copy numbers; 2) sufficient specificity 
to avoid hybridizing with unwanted sequences: for example, host sequences with limited similarity; 3) 

30 
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sufficient cross-reactivity so that differences between an unknown target and the sequence used to 
design it do not prevent the oligonucteotide from forming a stable duplex with the target. 

For some applications, a particularly effective design is oligonucleotides that have a degenerate 
segment at the 3' end, designed from a region of at least 2 known polynucleotides believed to be 
5 somewhat consen/ed with the polynucleotide target. The various permutations of the ambiguous 
residues help ensure that at least one of the alternative forms of the oligonucleotide will be able to 
hybridize with the target. Adjacent to the degenerate segment at the 5' end of the oligonucleotide is a 
consensus segment which strengthens any duplex which may fomn and permits hybridization or 
amplification reactions to be done at higher temperatures. The degenerate segment is located at the 3' 
10 end of the molecule to increase the likelihood of a close match between the oligonucleotide and the 
target at the site where elongation t>egin$ during a potymerase chain reaction. 

The ambiguous residues in the degenerate part of the sequences are indicated according to the 
following code: 



TABLE S: Single Letter Codes for Ambiguous Positions j 


Code 


Represents | 


R 


AorG (purine) 


Y 


C or T (pyrimidirw) 


W 


AorT 


S 


C or G 


M 


Aor C 


K 


GorT 


B 


CorGorT (not A) 


D 


A or GorT (not C) 


H 


A or C or T (not G) 


V 


A or C or G (not T) 


N 


A or 0 or G or T 



15 

The Type 1 oligonudeotides shown in Table 4 are generally useful for hytxidizir^ with 
Glycoprotein B encoding polynucleotide segments. This may be conducted to detect the presence of the 
polynucleotide, or to prime an amplification reaction so that the potynudeobde can be characterized 
further. Suitable targets irwiude polynucleotides encoding a region of a Glycoprotein B from a wide 

20 spectrum of gamma herpes viruses, including members of the RFHV/KSHV subfamily. Suitable are those 
infecting any vertebrate artimal, including humans arKl non-human primates, wtwther or not the 
Glycoprotein B or the virus has been previously known or described. Norvlimiting examples include 
polynucleotides encoding Glycoprotein B from any of the gamma herpes viruses listed in Table 1 . 

The oligonudeotides may be used, inter alia, to prime a reaction to amplify a region of the target 

25 potynudeotide in ttve 3' direction from the site where the oligonudeotide hybridizes. FRF OA, HIVPA. 
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TVNCB. FAYDA. rVGKA, CYSRA, NIDFB. FREYA. FREYB. NVFDB. GGMA. and TVNCA are 
digonudeotides with a consensus segment adjoining a degenerate segment, and are useful for tiiis 
purpose. 

Figure 2 shows the position along the Glycoprotein B polynucleotide sequence of the 
5 RFHV/KSHV subfamity where the aforementioned oligonucleotide primers are expected to hybridize. The 
map Is not drawn to scale, but accurately depicts the order of the predicted hybridization sites in the 5' to 3' 
direction along the Glycoprotein B encoding strand. Also indicated are approximate lengths of 
amplification products that may be generated by using various sets of primers In an amplification reaction. 
The lengths shown include the primer binding sites at each end, and the polynucleotide encompassed 
10 between them. 

A preferred source of DNA for use as a target for the oligonucleotides of Table 4 is any biological 
sample (including solid tissue and tissue cultures), particulariy of vertebrate animal origin, tcrv>wn or 
suspected to hartxsr a herpes virus. DNA is extracted from the source by any method known in the art, 
including extraction with organic solvents or precipitation at high salt corx^ntration. 

15 A preferred method of amplification is a polymerase chain reaction: see generally U.S. Patent No. 

4,683.195 (Mullts) and U.S. 4,683.202 (Mullis et al.}; see U.S. 5,176,995 (Sninsky et al.) for application to 
viral polynucleotides. An amplification reaction may be conducted by combining the target polynucleotide 
to be amplified with short oligonucleotides capable of hybridizing with the target and acting as a primer for 
the polymerization reaction. Also added are substrate mononucleotides and a heat-stable DNA-dependent 

20 Glycoprotein B. such as Taq. The conditions used for amplification reactions are generally known in the 
art. and can be optimized empirically using sources of known viruses, such RFHV. KSHV. hEBV or HSV1 . 
Conditions can be altered, for example, by changing the time and temperature of the amplification cycle, 
particulariy the hybridization phase; changing the molarity of the oligonucleotide primers: changing the 
buffer composition; and changing the number of amplification cycles. Fine-tunir^ the amplification 

25 conditions is a routine matter for a practitioner of ordinary skill in the art. 

In one method, a single primer of this invention is used in the amplification, optionally using a 
second primer, such as a random primer, to initiate replication downstream from the first primer and in the 
opposite direction. In a preferred method, at least two of the primers of this invention are used in the same 
reaction to initiate replication in opposite directions. The use of at least twvo specific primers enhances the 

30 specificity of the amplification reaction, and defines the size of the fragment for comparison between 
samples. For example, amplification may be performed using primers NIVPA and T^/NCB. More prefened 
is the use of several sets of primers in a nested fashion to enhance the amplification. Nesting is 
accomplished by performing a first amplification using pnmers that generate an intermediate product, 
comprising one or more internal binding sites for addrtior^i primers This is followed by a second 

35 amplification, using a new primer in conjunction with one from the previous set. or two new primers. The 
secorxl amplification product is therefore a subfragment of the first product. If desired, additional rounds of 
amplification can be performed using additiorul primers. 

Accordingly, resting amplification reactions can be performed with any combination of three or 
more oligonucleotide primers comprising at least one primer with a sense orientation and one primer with 
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an antisense orientation. Preferably, primers are chosen so that intennediate amplification products are no 
more than about 2000 base pairs; more preferably, they are no more than about 1500 base pairs; even 
more preferably, they are no more than about 750 base pairs. Preferably, the innemnost primers provide a 
final amplification product of no more than about 1200 base pairs; more preferably, they are no more than 
5 about 750 base pairs; even more preferably, they are no more than about 500 base pairs. Accordingly, a 
preferred combination is at least three primers selected from FAYDA, lYGKA. CYSRA, NIDFB. NVFDB. 
and FREYB. Another preferred combination is at least three primers selected from FRFDA. NIVPA. 
7VNCA. NIDFB. NVFDB. and FREYB. 

Particularty prefened is a first amplification using primer FRFDA and TVNCB. followed by a 
10 second amplification using primer NIVPA and TVNCB. When perfomied on a polynucleotide from a 
Glycoprotein B gene of KSHV. the size of the final fragment including the primer binding regions is about 
3B6 bases. 

The amplified polynucleotides can be characterized at any stage during the amplification reaction, 
for example, by size detemiination. Preferabty, this is perfixmed by running the polynucleotide on a gel of 

15 about 1-2% agarose. If present in sufficient quantity, the polynucleotide in the gel can be stained with 
ethidium bromide and detected under ultraviolet light. Altematrvety, the polynucleotide can be labeled with 
a radioisotope such as ^P or before loading on a gel of about 6% potyacryl amide, and the gel can 
subsequently be used to produce an autoradiogram. A preferred method of labeling the amplified 
polynucleotide is to end-label an oligonucleotide primer such as NIVPA with "P using a polynucleotide 

20 kinase and gamma-["P}-ATP, and continuing amplification for about 5-1 5 cydes. 

If desired, size separation may also be used as a step in the preparation of the amplified 
polynucleotide. This is particularty useful wt>en the amplification mixture is found to contain artifact 
polynucleotides of different size, such as may have arisen through cross-reactivity wrth undesired targets. 
A separating gel. such as described in the preceding paragraph, is dried onto a paper backing and used to 

25 pnxJuce an autoradiogram. Positions of the gel corresponding to the desired bands on the autoradiogram 
are cut out and extracted by standard techniques. The extiBCted polynucleotide can then be characterized 
directly, cloned, or used for a further round of amplification. 

The oligonucleotides NIVPASQ. TVNCBSQ. lYGKASQ. CYSRASQ. and NIDFBSQ are each 
derived from a consensus-degenerate Type 1 oligonucleotide. They retain the consensus segment, but 

30 lack the degenerate segment They are useful, inter alia, in sequencing of a Glycoprotein B polynucleotide 
fragment successfully amplified using a consensus-degenerate oligonucleotide. 

Unwanted potynudeotides in a mixture from an amplification reaction can also be proportionally 
reduced by shifting to primers of this type. For example, an initial 3-5 cydes of amplification can t>e 
conducted using primers NIVPA and TVNCB at 1/5 to 1/25 the normal amount. Then a molar excess (for 

35 example, 50 pmo!) of NIVPASQ and/or TVNCBSQ is added, and the amplification is continued for an 
additional 30-35 cydes. This reduces the complexity of tt>e digonudeotides present in the amplification 
mixtijre. and pemiits the reaction temperatures to be increased to reduce amplification of unwanted 
potynudeotides. 

33 



wo 97/12042 



PCT/US96/15702 



Type 2 oUgonucl&otide primers for Glycoprotein B of the RFHV/KSHV subfamily 

Type 2 oligonudeotides are intended for detection or amplification reactions for the Glycoprotein B 
of any virus of the RFHV/KSHV subfamily. They are designed from segments of the Glycoprotein B 
5 encoding region that are relatively well conserved between RFHV and KSHV, but not other previously 
sequenced gamma herpes vimses. Prefemed examples are shown in Table 6: 



TABUE 6: Type 2 Oligonucleotides used for Oetacttng, Amplifying, or Characterizing Herpes I 
Virus Polynucleotides encoding Glycoprotein B 


Target Gfyoopmtsin Binmihm FiFHV/KSHV subfamdy of herpes mBes 


Desfg- 
nation 


Sequence 

(S'to 3') 


Length 


No. of 
forms 


Orien- 
tition 


SEQ 

ID: 


SHMDA 


AGACCCGTGCCACTCTATGARATHAGYCAYA 
TGGA 


35 


24 


sense 


41 


SHMDASQ 


AGACCCGTGCCACTCTATGA 


20 


1 




42 


CFSSB 


GTTCAC/^ACAATCTTCATNGARCTRAARCA 


30 


32 


anth 
sanse 


43 


CFSSBSQ 


GTTCACAACAATC7TCAT 


18 


1 




44 


ENTFA 


GTCAACGGAGTAGARAAYACirrTYACNGA 


29 


128 


sense 


45 


DNIOB 


ACTGGCTGGCTAAAGTACCTTTGAATRTTRTC 
NGT 


35 


16 


anti- 
sense 


46 


ONIQBSQ 


ACTGGCTGGCTAAAGTACCTTTG 


23 


1 




47 



1 0 Type 2 oligonucleotides may be used for many purposes where specificity for the RFHV/KSHV 

subfamily specificity is desired. This includes the detection or ampltficatton of Glycoprotein B from 
known viruses of the RFHV/KSHV subfamily, or characterization of Glycoprotein B from new memtwrs 
of the family. 

SHMDA, CFSSB. EHTFA, an6 DNIQB are consensus-degenerate oligonucleotides with a 
15 degenerate 3' end, useful as initial primers for PGR amplifications, including polynucleotides of the 
RFHV/KSHV subfamily which are not identical to either RFHV or KSHV. SHMDASQ, CFSSBSQ. and 
DNIQBSQ contain only a consensus segment, and are useful for example in labeling or sequencing 
polynucleotides already amplified using the consensus-degenerate oligonucleotides. 

In one application, these Type 2 oligonucleotides are used individually or in combination as 
20 amplification primers. In one example of this application, the oligonucleotides are used directly on DMA 
obtained from a tissue sample to obtain a Glycoprotein B from the RFHV/KSHV subfamily, but not more 
distantly related viruses that may t>e present tn the same tissue, such as hEBV. hCMV or HSV1 . Thus. 
SHMDA ar>d DNIQB may be used as primers in a PGR. optionally preamptified using Type 1 
oligonucleotides such as NtVPA and TVNCB Other combinations are also suitable. In another 
25 example, one of the Type 2 oligonucleotides of Table 6 is used in combination with a suitable Type 1 

3^ 
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oligonudeotide listed earlier. Thus, NIVPA may be used in combination with DNIQB. or SHMDA may 
be used In combination with 7VNCB as primers in a PCR. The DNA source may optionally be 
preamplified using NIVPA and TV^CB. Other combinations are also suitable. 

In another application. Type 2 oligonucleotides, or oligonucleotides comprising these sequences 
or fragments thereof, are used as probes in a detection assay. For example, they can be provided with 
a suitable label such as ^^P, and then used in a hybndization assay with a suitable target, such as DNA 
amplified using FRFDA and/or NIVPA. along with TVNCB. 

Type 3 oitgonucteotide primers specific for Glycoprotein B of RFHV or KSHV 

Type 3 oligonucleotides are intended for detection or amptificatjon reactions specific for a 
particular virus. They are norwdegenerate segments of the Glycoprotein B encoding region of RFHV or 
KSHV that are relatively more variable between these two viruses and against other herpes viruses 
than are other segments of the region. Prefered examples are shown in Table 7. and in the Example 
section. 



3^ 
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TABLE 7: Type 3 OllgonucleoUdes used for Detecting. Amplifying, or Characterizing 
Herpes Virus Polynucleotides encoding Glycoprotein B 


Target Glycoprotein B from RFHV \ 


Desig- 
nation 


Sequence 

(5* to 3*) 


Length 


No. of 
forms 


Orfon-tation 


SEQ I 
ID: 


GMTEB 


TGCTGCTTCT6TCATACCGCG 


21 




anb-sense 


48 


AAITB 


TATTTGTTTGTGATTGCTGCT 


21 




anti-sense 


49 1 


GMTEA 


GCGGTATGACAGAAGCAGCAA 


21 




sense 


50 1 


KYEIA 


AACAAATATGAGATCCCCAGG 


21 




sense 


51 j 


TDRDB 


TCATCCCGATCGGTGAACGTA 


21 




anti-sense 


52 1 


VEGLB 


TTGTCAGTTAGACCTTCGACG 


21 




anti-sense 


53 1 


VEGLA 


CCCGTCGAAGGTCTAACTGAC 


21 




sense 


54 


PVLYA 


AGCCAACCAGTACTGTACTCT 


21 




sense 


55 


1 Target GlycoproMn B fnm KSHV 


1 DMig- 
1 nation 


Sequence 

(5- to 3*) 


Longth 


No. of 
forms 


Orien-tation 


SEQ 

ID: 


GLTEB 


TGATGGCGGACTCTGTCAAGC 


21 




antf- sense 


56 


TNKYB 


GTTCATACTTGTTGGTGATGG 


21 




anti-sense 


57 


GL7EA 


GGGCTTGACAGAGTCCGCCAT 


21 




sense 


56 


YELPA 


ACAAGTATGAACTCCCGAGAC 


21 




sense 


59 


VNVNB 


ACCCCGTTGACATTTACCTTC 


21 




anti-sense 


60 


TFTDV 


TCGTCTCTGTCAGTAAATGTG 


21 




anthsense 


61 


JVFIA 


CCACAGTATTCCTCCAACCAG 


21 




sense 


62 


SQPVA 


GGTACTTTAGCCAGCCGGTCA 


21 




sense 


63 



GMTEB. AAITB. GMTEA. KYEIA. TDRDB, VEGLB, VEGLA. and PVLYA are specific 
non-degenerate oligonucleotides for the RFHV Glycoprotein B, and can be used for the amplification or 
detection of Glycoprotein B encoding polynucleotides of RFHV origin Amptificatron is preferably done 
5 using the oligonucleotides in a nested fashion: e.g., a first amplification is conducted using GMTEA and 
VEGLB as primers: then a second amplification is conducted using KYEIA and TDRDB as pnmers. 
This provides an extremely sensitive amplification assay that is specific for RFHV Glycoprotein B, 
GMTEB arKj AAITB hybridize near the 5' end of the fragment, and may be used in combination wvith up- 
stream hybridizing Type 1 oligonucleotides to amplify or detect sequences in the 5' direction. VEGLA 
10 and PVLYA hybridize near the 3' end of the fragment, and may t>e used in combination with dowv 
stream hybridizing Type 1 oligonucleotides to amplify or detect sequences in the 3' direction. 

Similarly. GLTEB. TNKYB, GLTEA. YELPA, VNVNB. ENTFB, SQPVA. and TVFLA are specific 
norvdegenerate oligonucleotides for the KSHV Glycoprotein B, and can be used in a similar fashion, 
including as primers for an amplification reaction. Preferably, the amplification is done using the 
15 oligonucleotides in a nested fashion: e.g.. a first amplification is conducted using GLTEA and ENTFB 

3h 
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as primers; then a second amplification is conducted using YELPA and VNVNB as primers. This 
provides an extremely sensitive amplification assay that is specific for KSHV Glycoprotein B. GLTEB 
and TNKYB hybridize near the 5* end of the fragment, and may be used in combination with up-stream 
hybridizing Type 1 oligonucleotides to amplify or detect sequences in the 5' direction. SQPVA and 
5 TVFLA hybridize near the 3' end of the fragment, and may be used in combination with down-stream 
hybridizing Type 1 oligonucleotides to amplify or detect sequences in the 3' direction. 

Practitioners skilled in the an will immediately recognize that oligonucleotides of Types 1 . 2 and 
3 (in particular, those shown in Tables A, 6 and 7) can be used in combination with each other in a PGR 
to amplify different sections of a Glycoprotein B encoding polynucleotide. The specificity of the 
10 amplification reaction generally is determined by the primer with the least amount of cross reactivity. 
The size and location of the amplified fragment is determined by the primers used in the final round of 
amplification. For example. NIVPA used in combination with SQPVB will amplify about 31 0 bases of 
Glycoprotein B encoding polynucleotide from a virus closely related to KSHfV. Suitable combinations of 
oligonucleotides may be used as amplification primers in a nested fashion. 

15 

Use of synthetic oligonucleotides to characterize polynueieotide targets 

As described in the previous section, the oligonucleotides emt>odied in this invention, can be 
used as primers for amplification of polynucleotides encoding a herpes virus Glycoprotein B, particutariy 

20 in a polymerase chain reaction. 

The conditions for conducting the PGR depend on the nature of the oligonucleotide being used. 
In particular, when using oligonucleotides comprising a degenerate segment or a consensus segment 
that is only partly identical to the corresponding segment of the target, and when the target 
polynucleotide comprises an unknown sequence, the selection of conditions may be important to the 

25 success of the amplification. Optimizing conditions for a new primer or new polynucleotide target are 
routine for a practitioner of ordinary skill. What follows is a guide to assist in that objective. 

First, the temperature of the annealing step of the PGR is optimized to increase the amount of 
target polynucleotide being amplified above the amount of unrelated polynucleotide amplified. Ideally, 
the temperature permits the primers to hybridize with the target sequence but not with other sequences. 

30 For primers comprising a consensus segment (Type 1), the temperature of the annealing step in repeat 
cycles of a PGR is generally at least about 45'G: preferably it is at least about 50*G. tt is also preferable 
to conduct the first few cycles of the PGR at even higher temperatures, such as 55'C or even 60'C. 
The higher temperature wilt compel the annealing to t>e more sequence specific during the cycle and 
will reduce the background amplification of unrelated sequences. Annealing steps for subsequent 

35 cycles may be performed under slightly less stringent conditions to improve the rate of amplification. In 
an especially preferred procedure, the first PGR amplification cycle comprises an annealing step of 
atwut 1 min conducted at 60'C. Annealing steps in subsequent cycles are conducted at 2*G lower each 
cycle, until a temperature of 50*C is reached. Further cycles are then conducted with annealing steps 
at 50*G, until the desired degree of amplification is achieved. 



37 
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Primers which are virus-specific and do not contain a consensus segment (Type 3) are more 
selective, and may fc>e effective over a broader temperature range. Preferred temperatures for the 
annealing step In PGR amplification cycles are between 50*C and 65*C. 

Second, the buffer conditions are optimized. We have found that buffers supplied with 
5 commercial preparations of Tag polymerase are sometimes difficult to use, in part because of a critical 
dependence on the concentration of magnesium Ion. PCRs performed using the oligonucleotides of 
this invention generally are more easily performed using a buffer such as that suggested by M. Wigler 
(Lisitsyn et al.). Preferably, the final PGR reaction mixture contains (NH4)2504 instead of KGl as the 
principal ion source. Preferably, the concentration of (NH4)2S04 in the final reaction mixture is about 5- 

10 50 mM. more preferably about 10-30 mM, even more preferably 16 mM. The tjuffering component is 
preferably Tris, preferably at a final concentration of about 67 mM and a pH of about 8.8. Under these 
conditions, the MgClj concentration is less critical. Preferably the final concentration is about 1-10 mM, 
more preferably it is about 3-6 mM, optimally it is about 4 mM. The reaction mixture may also contain 
about 10 mM d-mercaptoethanol and O.OS-1 mg/mL bovine serum albumin. An especially prefen^ed 

15 buffer is WB4 buffer (67 mM Tris buffer pH 8.8. 4 mM MgCla. 16 mM (NH4)2S04. 10 mM fi- 
mercaptoethanol and 0.1 mg/mL albumin. Preferred conditions for performing the reaction are provided 
below in Example 3. 

To conduct the PGR reaction, a mixture comprising the oligonucleotide primers, the four 
deoxynudeotides, a suitable buffer, the DNA to be amplified, and a heat-stable DNA-dependent ONA 

20 polymerase is prepared. The mixture is then processed through temperature cycles for the annealing, 
elongating, and melting steps until the desired degree of amplrficatton is achieved. The amount of DNA 
produced can be determined, for example, by staining with ethidium bromide, optionally after separating 
amplified fragments on an agarose gel. 

A possible complication of the amplification reaction is dimerization and amplification of the 

25 oligonucleotide primers themselves. This can be easily detected as low molecular weight (<100 base 
pair) fragments on an agarose gel. Amplified primer can be removed by agarose or polyacrylamide gel 
separation. The amount of amplified dimer may be reduced by minor adjustments to the conditions of 
the reaction, particulariy the temperature of the annealing step. It is also preferable to pre-mix the 
primers, the deoxynudeotides, and the buffer, and heat the mixture to 80 degrees before adding the 

30 DtMA to be amplified. 

Amplification reactions using any the oligonucleotides of this invention as primers yield 
polynucleotide fragments encoding a portion of a Glycoprotein B These fragments can be 
characterized by a number of techniques known to a practitioner of ordinary skill in the art. Some 
non-limiting methods for characterizing a fragment are as follows; 

35 In one method, a fragment may be sequenced according to any method of sequence 

determination known in the art. including the Maxam & Gilbert method, or the Sanger & Nicholson 
method. Alternatively, the fragment may be submitted to any of the commercial organizations that 
provide a polynucleotide sequencing service. The fragment may optionally be cloned and/or amplified 
before sequencing The nucleotide sequence can t>e used to predict the amino acid sequence encoded 
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by the fragment. Sequence data can be used for comparison with other sequenced Glycoprotein B's. 
either at the polynucleotide level or the amino add level, to identify the species of herpes virus present 
in the original source material. Sequence data can also be used in modeling algorithms to predict 
antigenic regions or three-dimensional structure. 
5 In a second method of characterizing, the size of the fragment can be determined by any suitable 

method, such as running on a potyacrylamide or agarose gel. or centrifliging through an appropriate 
density gradient. For example, for RFHV and KSHV. the fragment twtween NIVPA and TVNCB is 
about 319 bases. Hence, the length of the entire amplified fragment including primer binding regions is 
at}out 386 bases. The corresponding fragment of sHV1 contains an additional 6 base pairs. The sHV1 

10 fragment can therefore be distinguished from that of RFHV or KSHV. for example, by running amplified 
polynucleotide fragments from each in neighboring lanes of a separating gel, or by ojnning the sHVI 
fragment t>estde suitable molecular weight standards. Polynucleotide fragments identical in size to that 
of RFHV and KSHV may be from the same or a related viral species. Fragments substantially different 
in size are more likely to be derived from a different herpes virus. 

15 In a third method of characterizing, a fragment can be tested by attempting to hybridize it with an 

oligonucleotide probe. In a preferred example, a fragment is tested for relatedness to the Glycoprotein 
B encoding region of RFHV or KSHV. The test is conducted using a probe comprising a sequence of a 
Glycoprotein B encoding region, or its genetic complement. Suitable probes are polynucleotides 
comprising sequences from RFHV or KSHV, such as the Type 3 oligonucleotides listed in Table 7. 

20 The length and nature of the prot>e and the hybridization conditions are selected depending on 

the objectives of the test. If the objective is to detect only polynucleotides from RFHV or KSHV. 
including minor strain variants, then hybridization is performed under conditions of high stringency. A 
sequence from the respective Glycoprotein B is used. Longer length sequences improve the specificity 
of the test and can be used under conditions of higher stringency. Preferably, the probe will comprise a 

25 Glycoprotein B sequence of at least about 30 nucleotides; more preferably, the sequence will be at 
least about 50 nucleotides; even more preferably, the sequence will be at least atMut 75 nucleotides in 
length. 

If the objective is to detect polynucleotides that are closely related but not identical to RFHV or 
KSHV. such as in a screening test or a test to recruit previously undescribed viruses of the RFHV/KSHV 

30 subfamily, then different conditions are chosen. Sequences from RFHV or KSHV may t>e used, but a 
mixture of the two or a degenerate probe is generally preferred. The length of the sequence and the 
conditions of the hybridization reaction are selected to provide sufficient specificity to exclude unwanted 
sequences, but otherwise provide a maximum degree of cross-reactivity amongst potential targets. 
Suitable conditions can be predicted using the formulas given eariier, by calculating the T^ and then 

35 calculating the corresponding temperature for the maximum degree of mismatch to be tolerated. The 
suitability of the conditions can be tested empirically by testing the cross-reactivity of the probes with 
samples containing known target polynucleotides encoding herpes Glycoprotein 8. 

The minimum degree of complementarity required for a stable duplex to form under the 
conditions of the assay will determine what Glycoprotein 6 sequences will hybridize with the probe. 
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Consider, for example, a target obtained from a human or non-human primate, amplified to produce a 
fragment corresponding to bases 36-354 of SEQ. ID N0:3, and then probed with the coaesponding 
fragment of the KSHV polynucleotide. According to the data in Table 2, if the hybridization reaction is 
performed under conditions that require only about 50% Identity for a stable duplex to form, the probe 
5 may hybridize with targets from any of tfie sequenced gamma herpes Glycoprotein B genes, including 
hEBV and sHVl . If the reaction is performed under conditions that require at least about 65% identity 
between probe and target, preferably at least about 67% identity, more preferably at least about 70% 
identity, and even more preferably at least about 75% identity for a stable duplex to form, the assay will 
detect a target polynucleotide from the RFHV/KSHV subfamily: i.e., either RHFV, KSHV, or a closely 
10 related herpes virus with a Glycoprotein B polynucleotide not yel sequenced. Even under hybridization 
conditions that required only about 50-55% identity for a stable duplex to form, a positive reaction would 
not Indicate the presence of bHV4. eHV2, or mHV68, since these viruses are not believed to be capable 
of infecting primates. 

It is possible to combine characterization by size and characterization by hybridization. For 
15 example, the amplified polynucleotide may be separated on a get of acrylamide or agarose, t>lotted to a 
membrane of suitable material, such as nitrocellulose, and then hybridized with a prot>e with a suitable 
lat>el, such as "P. The presence of the label after washing reflects the presence of hybridizable 
material in the sample, while the migration distance compared with appropriate molecular weight 
standards reflects the size of the material. A fragment sequence hybridizing with one of the 
20 aforementioned probes under conditions of high stringency but having an unexpected size would 
indicate a Glycoprotein B sequence with a high degree of identity to the probe, but distinct from either 
RFHV or KSHV. 



Use of polynuclBotldes and oligonucleotides to detect herpes virus infection 

25 

Polynucleotides encoding herpes virus Glycoprotein B, and synthetic oligonucleotides based 
thereupon, as embodied in this invention, are useful in the diagnosis of clinical conditions associated 
with herpes vims infection. For example, the presence of detectable herpes Glycoprotein B in a clinical 
sample may suggest that the respective herpes virus participated as an etiologic agent in the 

30 development of the condition. The presence of viral Glycoprotein B in a particular tissue, but not in 
surrounding tissue, may be useful in the localization of an infected lesion. Differentiating between 
gamma herpes virus and other herpes viruses in clinical samples may be useful in predicting the clinical 
course of an infection or selecting a drug suitable for treatment. Since Glycoprotein B is expressed by 
replicative virus. L-particles, and infected cells, we predict that it will serwe as a useful maricer for active 

35 and quiescent stages of the disease that involve expression of the protein in any of these forms. 

The procedures for conducting diagnostic tests are extensively known in the art, and are routine 
for a practitioner of ordinary skill. Generally, to perform a diagnostic method of this invention, one of the 
compositions of this invention is provided as a reagent to detect a target in a clinical sample with which 
it reacts For example, a polynucleotide of this invention may be used as a reagent to detect a DNA or 

40 RfMA target, such as might be present in a cell infected with a herpes virus. A polypeptide of this 
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invention may be used as a reagent to detect a target with which it is capakHe of forming a specific 
complex, such as an antibody molecule or (if the polypeptide is a receptor) the con-esponding tigand. 
An antibody of this invention may be used as a reagent to detect a target it specifically recognizes, such 
as a polypeptide expressed by viratly Infected celts. 
5 The target is supplied by obtaining a suitable tissue sample from an individual for whom the 

diagnostic parameter is to be measured. Relevant test samples are those obtained from individuals 
suspected of hartwring a herpes virus. Many types of samples are suitable for this purpose, including 
those that are obtained near the suspected site of infection or pathology by biopsy or surgical 
dissection, in vitro cultures of cells derived therefrom, solubilized extracts, blood, and t>lood 

1 0 components. If desired, the target may be partially purified from the sample or amplified before the 
assay is conducted. The reaction is performed by contactir>g the reagent with the sample under 
conditions that will allow a complex to form between the reagent and the target. The reaction may be 
performed in solution, or on a solid tissue sample, for example, using histology sections. The formation 
of the complex is detected by a number of techniques known in the art. For example, the reagent may 

15 t>e supplied with a label and unreacted reagent may be removed from the complex: the amount of 
remaining label thereby Indicating the amount of complex formed. Further details and altematives for 
complex detection are provided in the descriptions that follow. 

To determine whether the amount of complex formed is representative of herpes infected or 
uninfected cells, the assay result is preferably compared with a similar assay conducted on a control 

20 sample. It is generally preferable to use a control sample which is from an uninfected source, and 
otherwise similar in composition to the clinical sample being tested However, any control sample may 
be suitable provided the relative amount of target in the control is known or can be used for comparative 
purposes. It is often preferable to conduct the assay on the test sample and the control sample 
simultaneously. However, if the amount of complex formed is quantifiable and sufficiently consistent, it 

25 is acceptable to assay the test sample and control sample on different days or in different laboratories. 

Accordingly, polynucleotides encoding Glycoprotein B of the RFHV/KSHV subfamily, and the 
synthetic oligonucleotides embodied in this invention, can be used to detect gamma herpes virus 
polynucleotide that may be present in a biological sample. General methods for using polynucleotides 
in specific diagnostic assays are well known in the art: see. e.g.. Patent Application JP 5309000 

30 (latron). 

An assay employing a polynucleotide reagent may be rendered specific, for example: 1) by 
performing a hybridization reaction with a speafic probe: 2) by performing an amplification with a 
specific primer, or 3) by a combination of the two. 

To perform an assay that is specific due to hybridization with a spedfic probe, a polynucleotide is 
35 chosen with the required degree of complementarity for the intended target. Preferred probes include 
polynucleotides of at least about 16 nucleotides in length encoding a portion of the Glycoprotein B of 
RFHV. KSHV, or another member of the RFHV/KSHV subfamily. Increasingly preferred are probes 
comprising at least about 18, 21, 25. 30. 50. or 100 nucleotides of the Glycoprotein B encoding region. 



wo 97/12042 



PCT/US96/15702 



Also preferred are degenerate probes capable of fonning stable duplexes with polynucleotides of the 
RFI-fV/KSHV subfamily under the conditions used, but not polynucleotides of other herpes viruses. 

The probe is generally provided with a latwi. Some of the labels often used in this type of assay 
include radioisotopes such as and chemiluminescent or fluorescent reagents such as 
5 fluorescein, and enzymes such as alkaline phosphatase that are capable of producing a colored solute 
or precipitant. The label may be intrinsic to the reagent, it may be attached by direct chemical linitage. 
or it may be connected through a series of intermediate reactive molecules, such as a biotrn-avidtn 
complex, or a series of inter-reactive polynucleotides. The latjel may be added to the reagent before 
hybridization with the target polynucleotide, or afterwards. To improve the sensitivity of the assay, it is 
1 0 often desiratMe to increase the signal ensuing from hybridization. This can be accomplished by using a 
combination of serially hybridizing polynucleotides or branched polynucleotides in such a way that 
multiple label components twcome Incorporated into each complex. See U.S. Patent No. 5.124.246 
(Urdea et al.). 

If desired, the target polynucleotide may be extracted from the sample, and may also be partially 
15 purified. To measure viral particles, the preparation is preferably enriched for DNA; to measure active 
transcription of Glycoprotein B, the preparation is preferably enriched for RNA. Generally, it is 
anticipated that the level of polynucleotide of a herpes virus will be low in clinical samples: there may 
t>e just a few copies of OUA encodir>g the Glycoprotein B per celt wttere the virus is latent, or up to 
several hundred copies of DNA per cell where the virus is replicating. The level of mRNA will be higher 
20 in cells where the protein is actively expressed than those where the gene is inactive. It may therefore 
be desirable to enhance the level of target tn the sample by amplifying the DNA or RNA. A suitable 
method of amplification is a PCR, which is preferably conducted using one or more of the 
oligonucleotide primers emt}odied in this invention. RNA may be amplified by making a cDNA copy 
using a reverse transcriptase, and then conducting a PCR using the aforementioned primers. 
25 The target polynucleotide can t>e optionally subjected to any combination of additional 

treatments, including digestion with restriction endonucleases. size separation, for example by 
electrophoresis in agarose or polyacrylamide, and affixation to a reaction matrix, such as a blotting 
material. 

Hybridization is allowed to occur by mixing the reagent polynucleotide with a sample suspected 
30 of containing a target polynucleotide under appropriate reaction conditions. This may toe followed by 
washing or separation to remove unreacted reagent. Generally, both the target polynucleotide and the 
reagent must t>e at least partly equilibrated into tiie sir>gle-stranded form in order for complementary 
sequences to hybridize efficiently. Thus, it may be useful (particulariy in tests for DNA) to prepare the 
sample by standard denaturation techniques known in the art. 
35 The level of stringency chosen for the hybridization conditions depends on the objective of the 

test. If it is desired that the test be specific for RFHV or KSHV. then a probe comprising a segment of 
the respective Glycoprotein B is used, and the reaction is conducted under conditions of high 
stringency. For example, a prefen-ed set of conditions for use with a preferred probe of 50 nucleotides 
or more is 6 x SSC at 37'C in 50% formamide, followed by a wash at low ionic strength. This will 
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generally require the target to be at least about 90% identical with the polynucleotide probe for a stable 
duplex to form. The specificity of the reaction for the particular vinjs in question can also be increased 
by increasing the length of the probe used. Thus, longer probes are particulariy preferred for this 
application of the invention. Alternatively, if it is desired that the test also be able to detect other herpes 
5 viruses related to KSHV. then a lower stringency Is used. Suitable probes include fragments from the 
KSHV Glycoprotein B polynucleotide, a mixture thereof, or oligonucleotides such as those listed in 
Table 7. 

Appropriate hybridization conditions are determined to permit hybridization of the probe only to 
Glycoprotein B sequences that have the desired degree of identity with the probe. The stringency 

1 0 required depends on the length of the polynucleotide probe, and the degree of identity between the 
probe and the desired target sequence. Consider, for example, a probe consisting of the KSHV 
polynucleotide fragment between the hybridization sites of NIVPA and TVNCB. Conditions requiring a 
minimum identity of 60% would result in a stable duplex formed with a corresponding potynucieotide of 
KSHV and other gamma herpes viruses such as sHV1 ; conditions requiring a minimum identity of 90% 

15 would result in a stable duplex forming only with a polynucleotide from KSHV and closely related 
variants. Conditions of intennediate stringency requiring a minimum identity of 65-70% would permit 
duplexes to form with a Glycoprotein B polynucleotide of KSHV, and some other memtwrs of the 
RFHV/KSHV subfamily, but not with corresponding polynucleotides of other known herpes viruses, 
including gamma herpes vinjses eHV2, sHV1. mHV68. bHV4. EBV, and other human pathogens such 

20 as hCMV. hHV6. hVZV. and HSV1 . 

Conditions can be estimated beforehand using the formula given eariier. Preferably, the exact 
conditions are confimied by testing the probe with separate samples known to contain potynucteotides, 
both those desired to be detected and those desired to go undetected in the assay. Such samples may 
be provided either by synthesizing the pwlynucleotides from published sequences, or by extracting and 

25 amplifying DNA from tissues believed to be infected with the respective herpes vinjs. Detemnining 
hybridization conditions is a matter of routine adjustment for a practitioner of ordinary skill, and does not 
require undue experimentation. Since eHV2. sHVI, mHV68. bHV4 and EBV are more closely identical 
to the RFHV/KSHV subfamily than alpha and beta herpes vimses. conditions that exclude gamma 
herpes vinjses outside the RFHV/KSHV subfamily will generally also exclude the other herpes vinjses 

30 listed in Table 1 . In addition, if it is believed that certain vinises will not be present in the sample to be 
tested in the ultimate determination (such as eHV2. mHV68 or bHV4 in a human tissue sample), then 
the corresponding target sequences may optionally be omitted when working out the conditions of the 
assay. Thus, conditions can be determined that would pemiit Type 2 oligonucleotide probes such as 
those listed in Table 6 to form a stable duplex boXh with polypeptides comprising SEQ. ID N0:1 or SEQ. 

35 ID N0:3. but not a sequence selected from the group consisting of SEQ. ID NO:5-13. Conditions can 
also be determined that would permit a suitable fragment comprisir>g at least 21 or more consecutive 
bases of SEQ. ID NO:1 or SEQ. ID N0:3 to form a stable duplex both with a polynucleotide comprising 
SEQ. ID NO:1 and SEQ. ID NO:3. but not a polynucleotide comprising any one of SEQ. ID NO:5-13. 
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Alternatively, to conduct an assay that is specific due to ampfification with a specific primer: 
DtiA or RNA is prepared from the biological sample as before. Optionaliy, the target polynudeotrde is 
pre-amp(ifted in a PGR using primers which are not species specific, such as those listed in Table 4 or 
6. The target is then amplified using specific primers, such as those listed in Table 7, or a combination 
5 of primers from Table 4, 6, and 7. In a preferred emtsodiment, two rounds of amplification are 
performed, using oligonucleotide primers in a nested fashion: virus-specific or non-specific In the first 
round: virus-specific In the second round. This provides an assay which is both sensitive and specific. 

Use of a specific Type 3 primer during amplification is sufficient to provide the required 
specificity. A positive test may be indicated by the presence of sufficient reaction product at the end of 

1 0 the amplification series. Amplified polynucleotide can be detected, for example, by blotting the reaction 
mixture onto a medium such as nitrocellulose and staining with ethidium bromide. Altematively, a 
radiolat>eled substrate may t>e added to the mixture during a final amplification cycle: the incorporated 
tat>el may t>e separated from unincorporated label (e.g., by blotting or by size separation), and the label 
may be detected (e.g. by counting or by autoradiography). If ain on a gel of agarose or polyacrylamide. 

1 5 the size of the product may help confirm the identity of the amplified fragment. Specific amplification 
can also be followed by specific hybridization, by using the amplification mixture obtained from the 
foregoing procedure as a target source for the hybridization reaction outlined eariier. 

Use of polynuciootides for gene thempy 

20 

Embodied in this invention are pharmaceuticals comprising virus-specific polynucleotides, 
polypeptides, or antifc>odies as an active ingredient. Such compositions may decrease the pathology of 
the virus or infected cells on their own, or render the virus or infected cells more susceptible to 
treatment by non-specific pharmaceutical compounds. 

25 Polynucleotides of this invention encoding pari of a herpes virus Glycoprotein 6 may be used, for 

example, for administration to an infected individual for purposes of gene therapy (see generally U.S. 
Patent No. 5,399,346: Anderson et al.). The general principle is to administer the polynucleotide in 
such a way that it ether promotes or attenuates the expression of the polypeptide encoded therein. 

A preferred mode of gene therapy is to provide the polynucleotide in such a way that it will be 

30 replicated inside the cell, enhancing and prolonging the effect. Thus, the polynucleotide is operatively 
linked to a suitable promoter, such as the natural promoter of the corresponding gene, a heterologous 
promoter that is intrinsically active in cells of the target tissue type, or a heterologous promoter that can 
be induced by a suitable agent. Entry of the polynucleotide into the cell is facilitated by suitable 
techniques known in the art. such as providing the polynucleotide in the form of a suitat>le vector, such 

35 as a viral expression vector, or encapsulation of the polynucleotide in a liposome. The polynucleotide 
may t>e injected systemically. or provided to the site of infection by an antigen-specific homing 
mechanism, or by direct injection. 

In one variation, the polynucleotide comprises a promoter linked to the polynucleotide strand with 
the same orientation as the strand that is transcribed during the course of a herpes virus infection. 

40 Preferably, the Glycoprotein B that is encoded includes an extemal component, a transmembrane 

Hi 
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component, and signal sequences for transport to the surface. Viralty infected cells transfected with 
polynucleotides of this kind are expected to express an enhanced level of Glycoprotein B at the surface. 

Enhancing Glycoprotein B expression in this fashion may enhance recognition of these cells by 
elements of the immune system, including antibody (and antibody-dependent effectors such as ADCC). 
5 and virus-specific cytotoxic T cells. 

tn another variation, the polynucleotide comprises a promoter finked to the polynucleotide strand 
with the opposite orientation as the strand that is transcribed during the course of a herpes vinjs 
infection. Virally infected cells transfected with polynucleotides of this kind are expected to express a 
decreased level of Glycoprotein B. The transcript is expected to hybridize with the complementary 
10 strand transcribed by the viral gene, and prevent it from being translated. This approach is known as 
anti-sense therapy. 

RFHV/KSHV subfamity pofypoptides with Glycoprotein B activity and fragments thereof 

15 The RFHV and KSHV polynucleotide sequences shown in Figure 1 have open reading frames. 

The polypeptide encoded thereby are shown in SEQ. ID N0;2 and SEQ. ID N0:4. respectively. 
Encoded between the hybridizing regions of the primers NIVPA and TVNCB used to obtain the 
polynucleotide sequence is a 106 amino acid fragment of the Glycoprotein B molecule which is 91% 
identical between RFHV and KSHV. The full protein sequence of KSHV Glycoprotein B is shown in 

20 SEQ. ID NO; 94. A Glycoprotein B fragment of a third member of the RFHV/KSHV subfamily. RFHV2. is 
shown in SEQ. ID NO:97. 

There are a number of homologous residues to Glycoprotein B molecules of other sequenced 
herpes viruses. The longest sequence contained in SEQ. ID N0:2 or SEQ. ID NO:4 but not in the 
known sequences of other herpes vinjses is 9 amino acids in length, with two exceptions (SEQ. ID 

25 NOS:64 and 65). Longer matching portions are found elsewhere in the Glycoprotein B amino acid 
sequence. The longest is the 21 amino add sequence from bHV4 shown in SEQ. ID NO;99; the rest 
are all 16 amino acids long or less. Other than SEQ. ID NO:99 exception, any fragment of the RFHV 
and KSHV Glycoprotein B protein sequence that is 1 7 amino acids or longer is believed to be specific 
for RFHV or KSHV. respectively, or to closely related strains. Since bHV4 and the other viruses with 

30 matching segments are not believed to be capable of infecting primates, any fragment of about 10 
amino acids or more found in a primate that was contained in SEQ ID NO:4 would indicate the 
presence of an infectious agent closely related to KSHV. 

This invention emt)Odies tx)th intact Glycoprotein B from herpes viruses of the RFHV/KSHV 
subfamity, and any fragment thereof that is specific for the subfamily Preferred Glycoprotein B 

35 fragments of this invention are at least 10 amino acids in length: more preferably they are at least 13 
amino adds in length; more preferably they are at (east 17 amino adds in length: more preferably they 
are at least about 20 amino adds in length: even more preferably they are at least about 25 amino 
acids in length, still more preferably they are at least about 30 amino acids in length. 

The amino add sequence of the RFHV and KSHV Glycoprotein B fragment shown in SEQ. ID 

40 NOS:2. 4, 94 and 96 can be used to identify vinjs-spedftc and cross-reactive antigenic regions. 
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In principle, a specific antibody could recognize any amino add difference between sequences 
that is not also shared by the species fronr> which the antitjody is derived. Antibody binding sites are 
generally big enough to encompass 5-9 amino acid residues of an antigen, and are quite capable of 
recogniring a single amino acid difference. Specific antibodies may be part of a polyclonal response 
5 arising spontaneously in animals infected with a virus expressing the Glycoprotein B. Specific 
antibodies may also be induced by injecting an experimental animal with either the intact Glycoprotein 
B or a Glycoprotein B fragment. 

Thus, any peptide of 5 amirw adds or more that is unique to KSMV is a potential virus-specific 
antigen, and could be recognized t»y a KShtV-spedfic antibody. Simitariy. any peptide of sufficient 
10 length shared within the RFHV/KSHV subfamily but not with other herpes viruses is a potential 
subfamily-spedfic antigen. 

Some examples of prefen-ed peptides are shown in Table 8. Practitior>ers in the art will 
immediately recognize that other peptides with simitar spectficittes may be designed by minor 
alterations to the length of the peptides listed and/or moving the frame of the peptide a few residues in 
15 either direction. 

The Class I peptides shown In Table 8 are conserved between Glycoprotein B of KSHV and that 
of certain other members of the gamma herpes virus subfamily. An antibody directed against one such 
Glycoprotein B in this region may therefore cross-react with some of the others. Class II peptides are 
conserved between Glycoprotein B of RFHV and KSHV. but not with other gamma herpes viruses. An 

20 antibody directed against this region is expected to cross-react between RFHV, KSHV, and other 
viruses of the RFHV/KSHV sutjfamily; but not with herpes viruses outside the subfamily. Class III 
peptides are different twtween Glycoprotein B of RFHV. KSHV. and other known gamma herpes 
viruses. An antitxxly binding to this region, particularly to non-identical residues contained therein, is 
expected to distinguish RFHV and KSHV Glycoprotein B from each other, and from Glycoprotein B of 

25 more distantly related herpes viruses. 
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TABLES: Antigen Peptides 




SpKificity 


Sequence 


Length 


SEa ID 

MO* 


Class t: 


Shared with 










t>HV4 


YKIVlATovF VTKts 








t)HV4,mHV68 


RYrSQP 


ft 


DO 


Shared amongst RFHV/KSHV 
subfamily and some other 
gamma herpes viruses 


bHV4 


lYAEPGWFPGIYRVR 
lYAEPGWFPGIYRVRTTVNCE 


15 
21 


65 
99 




mHVSB 


VLEELSRAWCREQVRD 


16 


100 


Class 11: 




VTVYRG 


6 


67 






AfTNKYE 


7 


68 


Shared amongst RFHV/KSHV 

Am f ■ ■! ■ *l 1 

suDtanTBiy 




SHMDSTY 


7 


69 






VENTFTD 


7 


70 






7VFLQPV 


7 


71 






TDNIQRY 


7 


72 


Class 111: 


Specific for 










RFHV 


RGMTEAA 


7 


73 


Vtais specific^ 


KSHV 


RGLTESA 


7 


75 




RFHV 


PVLYSEP 


7 


74 




KSHV 


PVIYAEP 


7 


76 


^ • Not shared with any other sequenced herpes virus; may be 
present in some unseqoenced RFHV/KSHV subfamily vin»es 



Particularly preferred peptides from Class III are those encompassing regions of Glycoprotein B 
with the polarity characteristics appropriate for an antigen epitope, as described in the Example section. 
5 Given the complete sequence of a Glycoprotein B from KSHV and other members of the RFHV/KSHV 
subfamily, virus- or subfamily-specific peptides can be predicted for other regions of the molecule by a 
similar analysis. 

Proparation of polypeptides 

10 

Polypeptides of this invention may be prepared by several different methods, all of which will be 
known to a practitioner of ordinary skill. 

For example, short polypeptides of about 5-50 amino adds in length are conveniently prepared 
from sequence data by chemical synthesis. A preferred method is the solid-phase Menifield technique. 
15 Alternatively, a messenger RNA encoding the desired polypeptide may be isolated or synthesized 
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according to one of the methods de5crit>ed earlier, and translated using an in vrtro translation system, 
such as the rabt^it reticulocyte system. See, e.g., Dorsky et at. 

Longer polypeptides, up to and including the entire Glycoprotein B, are conveniently prepared 
using a suitable expression system. For example, the encoding strand of a full-length cD^4A can be 
5 operatively linked to a suitable promoter, inserted into an expression vector, and transfected into a 
suitable host cell. The host celt is then cultured under conditions that allow transcription and translation 
to occur, and the polypeptide is subsequently recovered. For examples of the expression and recovery 
of Glycoprotein B from other species of herpes virus, see, for example, U.S. Patent Nos. 4,642,333 
(Person); 5,244,792 (Burke et al.); Manservigi et al. 

10 For many purposes, it is particularly convenient to use a recombinant Glycoprotein B 

polynucleotide that includes the regions encoding signals for transport to the cell surface, but lacks the 
region encoding the transmembrane domain of the protein. The polynucleotide may be tmncated 5' to 
the transmembrar>e encoding region, or it may comprise both extracellular and cytoptasmic encoding 
region but lack the transmembrar>e region. Constructs of this nature are expected to be secreted from 

15 the cell in a soluble form. Where It is desirable to have a Glycoprotein B fragment that is a monomer, 
the recombinant may be designed to limit translation to about the first 475 amino adds of the protein. 

For example, to express any of these forms of Glycoprotein B in yeast, a cassette may be 
prepared using the glyceraldehyde-3-phosphate-dehydrogenase (GAPDH) promoter region and 
terminator region, GAPDH gene fragments are identified in a yeast library, isolated and I i gated in the 

20 appropriate configuration. The cassette is cloned into pBR322, isolated and confirmed by DNA 
sequencing. A pCl/1 plasmid is constructed containing a Glycoprotein B insert and GAPDH promoter 
and terminator regions. The plasmid is used to transform yeast strain S. cerevisiae. After culture, the 
yeast celts are pelleted by centrifugation, resuspended in a buffer containing protease inhibitors such as 
1 mM phenylmethylsulfonyl fluoride and 0.1 ng/m\ pepstatin. The washed cells are disrupted by 

25 voriexing with glass t>eads and recentrifuged. The presence in the supernatant of a Glycoprotein 6 of 
the correct size may be confirmed, for example, by Western blot using an antibody against Glycoprotein 
B, prepared as described in a following section Glycoprotein B may be purified from the supernatant 
by a combination of standard protein chemistry techniques, including ion exchange chromatography, 
affinity chromatography using antibody or substrate, and high-pressure liquid chromatography. 

30 To express Glycoprotein B in mammalian cells, for example, a mammalian expression vector 

such as pSVI/dhfr may be used. This has an ampictllin-resistance beta-lactamase gene, and a 
selectable mammalian cell marker, dihydrofotate reductase linked to the SV40 eariy promoter. 
Glycoprotein B polynucleotide blunt-end fragments are ligated into the pSVI/dhfr vector and digested 
with endonucleases to provide a cassette including the SV40 promoter, the Glycoprotein encoding 

35 region, and the SV40 splice and polyadenytation sites. The ptasmids are used, for example, to 
transform CHO celts deficient in dhfr, and transfectants are selected. Celts expressing Glycoprotein B 
may be identified, for example, by immunofluorescence using anti-Glycoprotein 6 as the primary 
antibody. 
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In another example, recombinant plasmids for expressing Glycoprotein B are cloned under 
control of the Rous sarcoma virus long terminal repeat in the episomal replicating vector pRP-RSV. 
This plasmid contains the origin of replication and early region of the human papovavirus BK, as well as 
the dhfr resistance marker. The vector is can then be used, for example, to transfomi human 293 cells. 
5 By using a Glycoprotein B encoding region devoid of the transmembrane spanning domain, the 
Glycoprotein B polypeptide is constitutively secreted into the culture medium at 0.15-0.25 pg/ce!l/day. 
In the presence of 0.6-6 methotrexate, production may be increased 10- to 100-fold, because of an 
amplification of the episomal recombinant. Glycoprotein B prepared in this way are appropriate, inter 
alia, for use in diagnosis, and to prepare vaccines protective against new and recun-ent herpes vims 
10 infections (Manservigi et al). 

Use of potyp9pti^9S to assess horpes virus infection 

The polypeptides embodied in this invention may t>e used to detect or assess the status of a 
15 herpes virus infection in an individual in several different applications. 

In one application, a polypeptide encoding a portion of a herpes vinjs Glycoprotein B is supplied 
as a reagent for an assay to detect the presence of antibodies that can specifically recognize it. Such 
antibodies may be present, for example, in the circulation of an individual with cun-ent or past herpes 
virus exposure. 

20 The preserjce of antitxxlies to Glycoprotein B in the circulation may provide a sensitive and early 

indication of viral infection. Since Glycoprotein B is a functional component of the viral envelope, it is 
produced in greater quantity than other transcripts sequestered within the viral particle. Its distribution 
is wider than transcripts that appear only transiently in the life cycle of the virus. Furthermore, it may 
expressed not only by intact virus, but also by non-infective products of virally infective cells, such as L- 

25 particles. Glycoprotein B from various species of herpes virus are knovm to be strongly immunogenic. 
Thus, detection of antitxKly to Glycoprotein B in an individual may be an indication of ongoing active 
herpes vinjs infection, latent infection, previous exposure, or treatment with a Glycoprotein B vaccine. 

Suitable clinical samples in which to measure antibody levels include serum or plasma from an 
individual suspected of having a gamma herpes viais infection. The presence of the antibody is 

30 detenmined. for example, by an immunoassay. 

A numt>er of immunoassay methods are established in the art for performing the quantitation of 
antit>ody using viral peptides (see, e.g., U.S. Patent No. 5,350,671: Houghton et al.). For example, the 
test sample potentially containing the specific antibody may be mixed with a pre-determined 
norvlimiting amount of the reagent polypeptide. The reagent may contain a directly attached label, such 

35 as an enzyme or a radioisotope. For a tiquid-priase assay, un reacted reagents are removed by a 
separation technique, such as filtration or chromatography. Alternatively, the antitxxJy in the sample 
may be first captured by a reagent on a solid phase. This may be, for example, the specific 
polypeptide, an antiimmunoglobulin, or Protein A. The captured antibody is then detected with a 
second reagent, such as the specific polypeptide, anti-immunoglobulin, or protein A with an attached 

40 label. At least one of the capture reagent or the detecting reagent must be the specific polypeptide. In 
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a third variation, cells or tissue sections containing the polypeptide may be overlaid first with the test 
sannple containing the antitxjdy. and then with a detecting reagent such as latieled anti-immunoglobulin. 
In all these examples, the amount of label captured in the complex is positively related to the amount of 
specific antibody present in the test sample. Similar assays can be designed in which antibody in the 
5 test sample competes with labeled antibody for binding to a limiting amount of the specific peptide. The 
amount of label in the complex is then negatively con-elated with the amount of specific antibody in the 
test sample. Results obtained using any of these assays are compared between test samples, and 
control samples from an uninfected source. 

By selecting the reagent polypeptide appropriately, antibodies of a desired specificity may be 

10 detected. For example. If the intact Glycoprotein B is used, or a fragment comprising regions that are 
conserved between herpes virus, then antibodies detected in the test samples may be virus specific, 
cross-reactive, or both. A multi-epitope reagent is prefen^d for a general screening assay tor 
antibodies related to herpes virus infection. To render the assay specific for antibodies directed either 
against RFHV or against KSHV, antigen peptides comprising non-conserved regions of the appropriate 

1 5 Glycoprotein B molecule are selected, such as those listed in Class III of Table 8. Preferably, a mixture 
of such peptides is used. To simultaneously detect antibodies against RFHV. KSHV, and closely 
related viruses of the gamma herpes family, but not sHVl and EBV. antigen peptides are selected with 
the properties of those listed in Class II of Table 8. Preferably, a mixture of such peptides is used. 

Arttibodies stimulated during a herpes virus infection may subside once the infection resolves, or 

20 they may persist as part of the immunological memory of the host. In the latter instance, antibodies due 
to cun^nt infection may be distinguished from antit}odies due to immunological memory by determining 
the class of the antibody. For example, an assay may be conducted in which antibody in the test 
sample is captured with the specific polypeptide, and then developed with labeled anti-tgM or anti-igG 
The presence of specific antibody in the test sample of the IgM class indicates ongoing infection, while 

25 the presence of IgG antibodies alone indicates that the activity is due to immunological memory of a 
previous infection or vaccination. 

Uso of polypeptides to design or screen anti-virat drugs 

30 Interfering with the Glycoprotein B gene or gene product would modify the infection process, or 

the progress of this disease. It is an objective of this invention to provide a method by which useful 
pharmaceutical compositions and methods of employing such compounds in the treatment of gamma 
herpes vinjs infection can be developed and tested. Particularly preferred are pharmaceutical 
compounds useful in treating infections by RFHV. KSHV and other members of the RFHV/KSHV 

35 subfamily. Suitable drugs are those that interfere with transcription or translation of the Glycoprotein B 
gene, and those that interfere with the twological function of the polypeptide encoded by the gene. It is 
rwt necessary that the mectwnism of interference be known; only tiiat the interference be preferential 
for reactions associated with the infectious process. 

Preferred dmgs include those that competitively interfere with the binding of the Glycoprotein B 

40 to its substrate on target cells, such as heparan sulfate and its analogs Also prefen^ed are daigs that 

SO 
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competitively interfere with any interaction of Glycoprotein B to other viral envelope components that 
may be necessary for the virus to exert one of Its biologic functions, such as penetration of target cells. 
Also preferred are molecules capable of cross-linking or otherwise immobilizing the Glycoprotein B, 
thereby preventing it from binding its substrate or performing any biological function that plays a role in 
5 vrraJ infectivity. 

This invention provides methods for screening pharmaceutical candidates to determine which 
are suitable for clinical use. The methods may be brought to bear on antiviral compounds that are 
currently known, and those which may be designed in the future. 

The method involves combining an active Glycoprotein B with the pharmaceutical candidate, and 

10 detamiining whether the biochemical function is altered by the pharmaceuticat candidate. The 
Glycoprotein B may be any fragment encoded by the Glycoprotein B gene of the RFHV/KSHV 
subfamily that has Glycoprotein B activity. Suitable fragments may be obtained by expressing a 
genetically engineered polypeptide encoding an active site of the molecule, or by cleaving the 
Glycoprotein B with proteases and purifying the active fragments. In a prefenied embodiment, the entire 

15 Glycoprotein B is provided. The reaction mixture will also comprise other components necessary to 
measure the biological activity of the protein. For example, in an assay to measure substrate binding, 
heparan sulfate or an analog thereof may be provided, perhaps linked to a solid support to facilitate 
measurement of the binding reaction. 

One emt>odiment of the screening method is to measure binding of the pharmaceutical candidate 

20 directly to the isolated Glycoprotein B, or a fragment thereof. Compounds that bind to an active site of 
the molecule are expected to interfere with Glycoprotein B activity. Thus, the entire Glycoprotein B. or a 
fragment comprising the active site, is mixed with the pharmaceutical candidate. Binding of the 
candidate can be measured directly, for example, by providing the candidate in a radiolabeled or 
stable-isotope lat>eled form. The presence of l3t>el bound to the Glycoprotein B can be determined, for 

25 example, by precipitating the Glycoprotein B with a suitable antibody, or by providing the molecule 
attached to a solid phase, and washing the solid phase after the reaction. Binding of the candidate to 
the Glycoprotein B may also be observed as a conformational change, detected for example by 
difference spectroscopy, nuclear magnetic resonance, or circular dichroism. Alternatively, binding may 
be determined in a competitive assay: for example. Glycoprotein B is mixed with the candidate, and 

30 then labeled nucleotide or a fragment of a regulatory subunit is added later. Binding of the candidate to 
the biochemically relevant site should inhibit subsequent binding of the labeled compound. 

A second embodiment of the screening method is to measure the ability of the pharmaceutical 
candidate to inhibit the binding of Glycoprotein B to a substrate or substrate analog. A prefen^d analog 
is heparin, coupled a solid support such as Sepharose^ beads. Inhibition may be measured, for 

35 example, by providing a radiolatMl to the Glycoprotein B. incubating it with the pharmaceutical 
candidate, adding the affinity resin, then washing and counting the resin to determine if the candidate 
has decreased the amount of radioactivity bound. Pharmaceutical candidates may also be tested for 
their ability to competitively interfere with interactions between Glycoprotein B and other herpes virus 
proteins. 
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A third embodiment of the screening method is to measure the ability of the pharmaceutical 
candidate to inhibit an activity of an active partide, such as a viral particle, mediated by Glycoprotein B. 
A particle is engineered to express Glycoprotein B, but not other components that are capable of 
mediating the same function. The ability of the particle to exhibit a biological function, such as 
5 substrate binding or membrane fusion, is then measured in the presence and absence of the 
pharmaceutical car>didate tjy providing an appropriate target. 

This invention also provides for the development of phanmaceuticats for the treatrnent of herpes 
infection by rational drug design. See, generally, Hodgson, and Erictcson et al. In this emtxsdiment, the 
three-dimensional structure of the Glycoprotein B is determined, either by predictive modeling based on 
10 the amino acid sequence, or preferably, by experimental determination. Experimental methods include 
antibody mapping, mutational ar\alysis, and the formation of anti-idiotypes. Especially preferred is X-ray 
crystallography. Knowing the three-dimensional structure of the glycoprotein, especially the orientation 
of important amino acid groups near the substrate binding site, a compound is designed de novo, or an 
existing compound is suitably modified. The designed compound will have an appropriate charge 
15 balance, hydrophobicity, and/or shape to permit it to attach near an active site of the Glycoprotein B, 
and sterically interfere with the normal biochemical function of that site. Preferably, compounds 
designed by this method are subsequently tested in a drug screening assay, such as those outlined 
atx>ve. 

20 AntibodiBS against Giycoprotaln B and thair praparation 

The amino acid sequence of the Glycoprotein B molecules embodied herein are foreign to the 
hosts they infect. Glycoprotein B from other species of herpes virus are known to be strongly 
immunogenic in mammals. Anti-Glycoprotein B is formed in humans, for example, as a usual 

25 consequence of infection with hCMV. By analogy, it is expected that Glycoprotein B of RFHV, KSHV, 
and other members of the RFhlV/KSHV subfamily will be immunogenic in mammals, including humans 
These expectations are supported by the observations descrit>ed in the Example section below. 

Antibodies against a fx>lypeptide are generally prepared by any method kr>own in the art. To 
stimulate antitx>dy production in an animal experimentally, it is often preferat>le to enhance the 

30 immunogenicity of a polypeptide by such techniques as polymerization with glutaraldehyde, or 
combining with an adjuvant, such as Freund's adjuvant. The immunogen is injected into a suitable 
experimental animal: preferably a rodent for the preparation of monoclonal antitx)dies; preferably a 
larger animal such as a rabbit or sheep for preparation of polyclonal antibodies. It is preferable to 
provide a second or booster injection after about 4 weeks, and begin harvesting the antibody source no 

35 less than about 1 week later. 

Sera harvested from the immunized animals provide a source of polyclonal antibodies. Detailed 
procedures for purifying specific antitxsdy activity from a source material are known within the art. If 
desired, the specific antibody activity can be further purified by such techniques as protein A 
chromatography, ammonium sulfate precipitation, ion exchange chromatography, high-performance 
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liquid chromatography and immunoaffinity chromatography on a column of the immunizing polypeptide 
coupled to a solid support. 

Polyclonal antitiodies raised by immunizing with an intact Glycoprotein B or a fragment 
comprising conserved sequences may be cross-reactive between herpes viruses. Antibodies that are 
5 vinjs or subfamily specific may be raised by immunizing with a suitably spedfic antigen, such as those 
listed above in Table 8. Alteniatively. polyclonal antibodies raised against a larger fragment may be 
rendered specific by removing unwanted activity against other virus Glycoprotein B's. for example, by 
passing the antibodies over an adsorbent made from Glycoprotein 8 and collecting the unbound 
fraction. 

1 0 Alternatively, immune cells such as splenocytes can be recovered from the immunized animals 

and used to prepare a monoclonal antibody-producing cell line. See, for example, Hanow & Lane 
(1938), U.S. Patent Nos. 4.472,500 (Milstein et al.). and U.S. 4.444.887 (Hoffman et al.). 

Briefly, an antibody-producing tine can t>e produced inter alia by cell fusion, or by transforming 
antibody- producing celts with Epstein Barr Virus, or transforming with oncogenic DNA. The treated 

15 cells are cloned and cultured, and clones are selected that produce antitxxiy of the desired specificity. 
Specificity testing can be performed on culture supematants by a number of techniques, such as using 
the immunizing polypeptide as the detecting reagent in a standard immunoassay, or using cells 
expressing the polypeptide in immunohistochemistry. A supply of monoclonal antit}ody from the 
selected clones can be purified from a large volume of tissue culture supematant, or from the ascites 

20 fluid of suitably prepared host animals injected with the clone. 

Effective variations of this method include those in which the immunization with the polypeptide 
is performed on isolated cells. Antibody fragments and other derivatives can be prepared by methods 
of standard protein chemistry, such as subjecting the antitxxjy to cleavage with a proteolytic enzyme. 
Genetically engineered variants of the antibody can t>e produced by obtaining a polynucleotide 

25 encoding the antibody, and applying the general methods of molecular biology to irtroduce mutations 
and translate the variant. 

Monoclonal antibodies raised by injecting an intact Glycoprotein B or a fragment comprising 
conserved sequences may t>e cross-reactive between herpes viruses. AntitxxJies that are virus or 
subfamily specific may be raised by immunizing with a suitably specific antigen, as may be selected 

30 from Table 8. Alternatively, virus- specific clones may be selected from the cloned hybridomas by using 
a suitable antigen, such as one selected from Class 111 of Table 8. in the screening process. 

Specific antibodies against herpes virus Gtycoprotein B have a number of uses in 
developmental, diagrrastic and therapeutic work. For example, antibodies can be used in drug 
screening (see U.S. Patent No. 5,120.639). They may also be used as a component of a passive 

35 vaccine, or for detecting herpes virus in a biological sample and for drug targeting, as described in the 
following sections. 

AntMdiotypes relating to Glycoprotein B may also be prepared. This is accomplished by first 
preparing a Glycoprotein B antibody, usually a monoclonal antibody, according to the aforementioned 
methodology. The antitxxiy is then used as an immunogen in a volunteer or experimental animal to 
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raise an anti-idiotype. The anti-idiotype may be either monoclonal or polyclonal, and its development is 
generally according to the methodology used for the first antibody. Selection of the anti-idiotype or 
hybridoma clones expressing anti-idiotype is done using the immunogen antibody as a positive 
selector, and using antibodies of unrelated specificity as negative selectors. Usually, the negative 
5 selector antibodies will be a polyclonal immunoglobulin preparation or a pool comphsing monoclonal 
immunoglobulins of the same Immunogtobulin class and subclass, and the same species as the 
immunogen antibody. An antt-idrotype may be used as an alternative component of an active vaccine 
against Glycoprotein B. 

1 0 Use of antibodies for detecting Glycoprotein B in bloiogical sampies 

Antibodies specific for Glycoprotein B can be used to detect Glycoprotein B polypeptides and 
fragments of viral ohgin that may be present, for example, in solid tissue samples and cultured cells. 
Immunohistologicat techniques to carry out such determinations will be obvious to a practitioner of 

15 ordinary skill. Generally, the tissue is preserved by a combination of techniques which may include 
freezing, exchanging into different solvents, fixing with agents such as parafomialdehyde, drying with 
agents such as alcohol, or emt>edding in a commercially available medium such as paraffin or OCT. A 
section of the sample is suitably prepared and overlaid with a primary antibody specific for the protein. 
The primary antibody may be provided directly with a suitable lat>el. More frequently, the primary 

20 antibody is detected using one of a number of developing reagents which are easily produced or 
available commercially. Typically, these developing reagents are ant>-immunoglot>ulin or protein A, and 
they typically bear labels wtiich include, but are not limited to. fluorescent markers such as fluorescein, 
enzymes such as peroxidase that are capable of precipitating a suitable chemical compound, electron 
dense markers such as colloidal gold, or radioisotopes such as ^^^1. The section is then visualized 

25 using an appropriate microscopic technique, and the level of labeling is compared between the 
suspected virally infected and a control cell, such as cells surrounding the area of infection or taken 
from a remote site. 

Proteins encoded by a Glycoprotein B gene can also be detected in a standard quantitative 
immunoassay. If the protein is secreted or shed from infected cell in any appreciable amount, it may be 

30 detectable in plasma or serum samples. Alternatively, the target protein may t>e solubtlized or extracted 
from a solid tissue sample. Before quantitating, the protein may optionally be affixed to a solid phase, 
such as by a blot technique or using a capture antibody. 

A number of immunoassay methods are established in the art for performing the quantitation. For 
example, the protein may be mixed with a pre-determined non-limiting amount of the reagent antibody 

35 specific for the protein. The reagent antibody may contain a directly attached label, such as an enzyme 
or a radioisotope, or a second labeled reagent may be added, such as anti-tmmunoglobulin or protein 
A. For a solid-F>hase assay, unreacted reagents are removed by washing. For a liquid-phase assay, 
unreacted reagents are removed by some other separation technique, such as filtration or 
chromatography. The amount of label captured in the complex is positively related to the amount of 

40 target protein present in the test sample A variation of this technique is a competitive assay, in which 
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the target protein competes with a labeled analog for binding sites on the specific antibody. In this 
case, the amount of label captured is negatively related to the amount of target protein present in a test 
sample. Results obtained using any such assay are compared between test samples, and control 
samples from an uninfected source. 

5 

Use of antibodies for drug targeting 

An example of how antibodies can be used In therapy of herpes virus infection is in the specific 
targeting of effector components. Virally infected cells generally display peptides of the virus, especially 

1 0 proteins expressed on the outside of the viral envelope. The peptide therefore provides a marker for 
infected celts that a specific antibody can bind to. An effector component attached to the antibody 
therefore becomes concentrated near the infected cells, improving the effect on those cells and 
decreasing the effect on uninfected cells. Furthemnore. if the antibody is able to induce endocytosis. 
this will enhance entry of the effector into the cell interior. 

15 For the purpose of targeting, an antibody specific for the viral polypeptide (in this case, a region 

of a Glycoprotein B) is conjugated with a suitable effector component, preferably by a covalent or 
high-affinity bond. Suitable effector components in such compositions include radionuclides such as 
"^1, toxic chemicals, and toxic peptides such as diphtheria toxin. Another suitable effector component 
is an antisense polynucleotide, optionally encapsulated in a liposome. 

20 

Diagnostic fdts 

Diagnostic procedures using the polynucleotides, oligonucJeotides, peptides, or antibodies of this 
invention may be performed by diagnostic laboratories, experimental latwratories, practitioners, or 

25 private individuals. This invention provides diagnostic kits which can be used in these settings. The 
presence of a herpes virus in the individual may be manifest in a clinical sample obtained from that 
individual as an alteration in the DIslA. RNA. protein, or antibodies contained in the sample. An 
alteration in one of these components resulting from the presence of a herpes virus may take the form 
of an increase or decrease of the level of the component, or an alteration in the form of the component. 

30 compared with that in a sample from a healthy individual. The clinical sample is optionally pre-treated 
for enrichment of the target being tested for. The user then applies a reagent contained in the kit in 
order to detect the changed level or alteration in the diagnostic component. 

Each kit necessarily comprises the reagent which renders the procedure specific: a reagent 
polynucleotide, used for detecting target DHA or RNA: a reagent antibody, used for detecting target 

35 protein: or a reagent polypeptide, used for detecting target antibody that may be present in a sample to 
be analyzed. The reagent is supplied in a solid form or liquid buffer that is suitable for inventory 
storage, and later for exchange or addition into the reaction medium when the test is performed 
Suitable packaging is provided. The kit may optionally provide additional components that are useful in 
the procedure. These optional components irKdude buffers, capture reagents, developing reagents, 
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labels, reacting surfaces, means for detection, control samples, instnjctions. and interpretive 
infonnation. 

Other members of the RFHV/KSHV subfamily 

5 

RFHV and KSHV are exemplary members of the RFHV/KSHV subfamily. This invention 
embodies polynucleotide sequerKes encoding Glycoprotein B of other memt>ers of the subfamily, as 
defined herein. The consensus-degenerate gamma herpes virus oligonucleotide Type 1 and 2 primers, 
and the methods descritwd herein are designed to be suitable for characterization of the corresponding 

1 0 polynucleotide fragment of other members of the RFHV/KSHV subfamily. One such member is another 
virus infecting monkeys, designated RFHV2. A segment of the Glycoprotein encoding sequence for this 
virus was cloned from RF tissue obtained from a Macaca mulatta monkey, as described in Example 12. 

In order to identify and characterize other members of the family, reagents and methods of this 
invention are applied to DNA extracted from tissue samples suspected of being infected with such a 

1 5 virus. 

Suitable sources of DNA for this purpose include biological samples obtained from a wide range 
of conditions occurring in humans and other vertebrates. Preferred are conditions in which the agent is 
suspected of being lymphotrophic, similar to other members of the gamma herpes virus subfamily; for 
example, infectious mononucleosis of non-EBV origin. More preferred are conditions which resemble in 

20 at least one of their clinical or histological features the conditions with which RFHV or KShlV are 
associated. These include: a) conditions in which fibroproliferation Is part of the pathology of the 
disease, especially in association with collagen deposition, and especially where the fibrous tissue is 
disorganized: b) conditions involving vascular dysplasia: c) conditions involving malignant 
transformation, especially but not limited to cells of lymphocyte lineage; d) conditions for which an 

25 urxSerlylng immunodeficiency contributes to the frequency or severity of the disease; e) conditions 
which arise idiopathically at multiple sites in an organ or in the body as a whole; f) conditions which 
epidemiological data suggests are associated with an infectious or environmental agent. Conditions 
which fulfill more than one of these criteria are comparably more preferred. Some examples of 
especially preferred conditions include retroperitoneal fibrosis. nodular fibromatosis, 

30 pseudosarcomatous fibromatosis, fibrosarcomas, sclerosing mesenteritis, acute respiratory disease 
syndrome, idiopathic pulmonary fibrosis, diffuse proliferative glomerulonephritis of various types, 
gliomas, glioblastomas, gliosis, and all types of leukemias and lymphomas. 

The type of tissue sample used will depend on the clinical manifestations of the condition. 
Samples more likely to contain a virus associated with the condition may be taken from the site involved 

35 in the disease pathology, or to which there is some other evidence of viral tropism. Peripheral blood 
mononuclear cells of an infected individual may also act as a canier of an RFHV/KSHV subfamily virus. 
KSHV has been detected in PBMC of both Kaposi's Sarcoma (Moore et al. 1995b} and Castleman s 
disease (Dupin et al.). Other suitable sources are cell cultures developed from such sources, and 
enriched or isolated preparations of virus obtained from such sources. For negative control samples, 
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tissue may be obtained from apparently unaffected sites on the same individuals, or from matched 
ifKJividuals who apparently do not suffer from the condition. 

The process of identification of members of the RFHV/KSHV subfamily preferably involves the 
use of the methods and reagents provided in this invention, either singularly or in combination. 
5 One method involves amplifying a polynucleotide encoding a herpes virus Glycoprotein B from 

DMA extracted from the sample. This can be performed, for example, by amplifying the polynucleotide 
in a reaction such as a PCR. In one variation, the amplification reaction is primed using broadly specific 
consensus-degenerate Type 1 oligonucleotides, such as those shovm in Table 4. This will amplify 
herpes vimses, primarily of the gamma type. Since the RFHV/KSHV subfamily is a subset of gamma 

10 herpes vinjses. Glycoprotein B sequences detected by this variation need to be characterized further to 
determine whether they fall into the RFHV/KSHV subfamily. In a second variation, the amplification is 
primed with RFHV or KSHV specific Type 3 oligonucleotides, such as those listed in Table 7. or other 
Glycoprotein B polynucleotide segments taken from these viruses. The amplification is conducted 
under conditions of medium to low stringency, so that the oligonucleotides will cross-hybridize writh 

15 related species of viruses. In a more prefen-ed variation, the amplification reaction is primed using 
RFHV/KSHV subfamily specific Type 2 oligonucleotides, such as those listed in Table 6. Under 
appropriate hybridization conditions, these primers wrill preferentially amplify Glycoprotein B from herpes 
viruses in the subfamily. 

Prefen-ed members of the subfamily detected using a Glycoprotein B polynucleotide probe are 

20 those that are at least 65% identical vinth the RFHV or KSHV Glycoprotein B nucleotide sequence 
between residues 36 and 354 of SEQ. ID N0:1 or SEQ. ID N0:3. f^ore prefen-ed are those that are at 
least atx>ut 67% idenbcal; more preferred are those at least about 70% identical; more preferred are 
those that are at least atjout 80% identical; even more preferred are those about 90% identical or more. 
Members of the subfamily can also be identified by performing a hyl>ridization assay on the 

25 polynucleotide of the sample, using a suitable probe. The polynucleotide to t>e tested may optionally be 
amplified before conducting the hybridization assay, such as by using Type 1 or Type 2 
oligonucleotides as primers. The target is then tested in a hybridization reaction with a suitable labeled 
probe. The probe preferably comprises at least 21 nucleotides, preferably at least about 25 
nucleotides, more preferably at least about 50 nudectides contained the RFHV or KSHV Glycoprotein B 

30 sequence in SEQ. ID N0S:1 and 3. Even more preferably, the probe comprises nucleotides 36-354 of 
SEQ. ID NOS:1 or 3. Other preferred probes include Type 2 oligonucleotides, such as those shown in 
Table 6. Hybridization conditions are selected to permit the probe to hybridize with Glycoprotein B 
polynucleotide sequences from the RFHV/KSHV subfamily, but not previously sequenced herpes 
viruses; particulariy sHVI. bHV4. eHV2. mHV68. hEBV. hCMV. hHV6. hVZV. and HSV1. Formation of 

35 a stable duplex with the test polynucleotide under these conditions suggests the presence of a 
polynucleotide in the sample derived from a member of the RFHV/KSHV subfamily. 

Members of the subfamily can also be identified by using a Class II antibody, the preparation of 
which was outlined eariier. A Class II antitwdy cross-reacts between antigens produced by members of 
the RFHV/KSHV subfamily, but not vnth other antigens, including those produced by heroes viruses not 
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members of the subfamily. The test for new subfamily memers is performed, for example, by using the 
antibodies in an immunohistochemistry study of tissue sections prepared from individuals with the 
conditions listed atx}ve. Positive staining of a tissue section with the antibody suggests the presence of 
Glycoprotein B in the sample from a member of the RFHV/KSHV subfamily, probably because the 
5 tissue is infected with the virus. If. in addition, the tissue section is norvreactive with RFHV and KSHV 
specific Class III antibodies, the Glycoprotein B in the tissue may be derived from another member of 
the subfamily. Similarly, if Class II antibodies are found in the circulation of an individual, the individual 
may have been subject to a present or past infection with a member of the RFHV/KSHV subfamily. 

Once a putative new virus is identified by any of the aforementioned methods, its membership in 

10 the RFHV/KSHV subfamily may be confimried by obtaining and sequencing a region of me Glycoprotein 
B gene of the vims, and comparing it with that of RFHV or KSHV according to the subfamily definition. 
For new members of the RFHV/KSHV sutrfamily. other embodiments of this invention may be brought 
into play for purposes of detection, diagrtosis, and pharmaceutical development. Adaptation of the 
embodiments of the invention for a new subfamily member, If required, is expected to be minor in 

1 5 nature, and will be obvious based on the new sequence data, or a matter of routine adjustment. 

Ahend forms of Glycoprotein B from the RFHV/KSHV subfamily 

This invention also emt>odies altered forms of Glycoprotein B of the RFhlV/KSHV subfamily. 

20 A number of studies on both naturally occurring and induced mutaticKis of the Glycoprotein B of 

HSV1 and hCMV point to a role of certain regions of the molecule for its the various biochemical 
functions. See. for example, Reschke et al. and Baghian et at. for a role of carboxy-terminal amino 
adds in fusion; Shiu et al. and Pellett et al. for epitopes for neutralizing antibodies; Gage et al. for 
regions of the molecule involved in syncytium formation: Navarro et al. (1992) for regions Involved in 

25 virus penetration and cell-to-celt spread: Quadri et al. and Novarro et al. (1991) for regions involved in 
intracellular transport of Glycoprotein B during biosynthesis. 

Some of the residues described may be conserved l^etween the Glycoprotein B molecules of the 
viruses investigated previously, and the Glycoprotein B molecules descritwd herein. By analogy, 
mutation of the same residue in the Glycoprotein B of the RFHV/KSHV subfamily is expected to have a 

30 similar effect as described for other viruses. Alternatively, functional regions of different Glycoprotein B 
molecules may be combined to produce Glycoprotein B recombinants with altered function. For 
example, replacing the Glycoprotein B gene in a pathogenic virus with that of a non- pathogenic virus 
may reduce the pathogenicity of the recombinant (Kostal et al.). Either mutation and recombination of 
Glycoprotein 6 of the RFHV/KSHV herpes virus subfamily may lead to attenuated strains, in which 

35 either the infectivity. replication activity, or pathogenicity is reduced. Alterations in the Glycoprotein B 
sequence which have these effects are contemplated in this invention. 

Attenuated sti^ins of herpes viruses are useful, for example, in developing polyvalent vaccines. 
It is desirable, especially in developing countries, to provide prophylactic vaccines capable of 
stimulating the immune system against several potential pathogens simultaneously. Vinjses that are 

40 engineered to express immunogenic peptides of several different pathogens may accomplistt this 
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purpose. Herpes viruses may be especially suitable vectors, because the large genome may easily 
accommodate several kilobases of extra DNA encoding the peptides. Ideally, the viral vector is 
sufficiently intact to exhibit some biological activity and attract the attention of the host's immune 
system, while at the same time being sufficiently attenuated not to cause significant pathology. Thus, 
5 an attenuated vims of the RFHV/KSHV subfamily may be useful as a vaccine against like virulent 
fomis. and may t>e modified to express additional peptides and extend the range of immune protection. 

Arwther use for attenuated forms of herpes viruses Is as delivery vehicles for gene therapy 
(Latchman et al.. Glorioso et al ). In order to be effective, polynucleotides in gene therapy must be 
delivered to the target tissue site. In the treatment of fibrotic diseases, malignancies and related 
10 conditions, attenuated viral vectors of the RFHV/KSHV subfamily may be preferable over other 
targeting mechanisms, including other herpes viruses, since they have the means by which to target 
towards the affected tissues. In this embodiment, the virus is first attenuated, and then modified to 
contain the polynucleotide that is desired for gene therapy, such as those that are outlined in a previous 
section. 

15 

Giycopmt9ln 8 in RFHV/KSHV subfamily vaccines 

Because of its prominence on the envelope of the infectious virus and infected cells, glycoprotein 
B is predicted to be a useful target for immune effectors. Herpes virus Glycoprotein B is generally 

20 immurKjgenic. giving rise to antibodies capable of neutralizing the virus and preventing it from entering 
a replicative phase. In addition, Glycoprotein B is capable of eliciting a T-cell response, which may help 
eradicate an ongoing viral infection by attacking sites of viral replication in host cells. 

This invention embodies vaccine compositions and methods for using them in the prevention and 
management of infection by viruses from the RFHV/KSHV subfamily. 

25 One series of embodiments relate to active vaccines. These compositions are designed to 

stimulate an immune response in the individual being treated against Glycoprotein B. They generally 
comprise either the Glycoprotein B molecule, an immunogenic ft^gment or variant thereof, or a cell or 
particle capable of expressing the Glycoprotein B molecule. Alternatively, they may comprise a 
polynucleotide encoding an immunogenic Glycoprotein B fragment (Horn et al.), preferably in the form 

30 of an expression vector. Polynucleotide vaccines may optionally comprise a delivery vehicle like a 
liposome or viral vector particle, or may be administered as naked DNA. 

Vaccine compositions of this invention are designed in such a way that the immunogenic 
fragment is presented to stimulate the proliferation and/or biological function of the appropriate immune 
cell type. Compositions directed at eliciting an antibody response comprise or encode B cell epitopes. 

35 and may also comprise or encode other elements that enhance uptake and display by 
antigen-presentabon cells, or that recnjit T cell help. Compositions directed at eliciting helper T cells, 
especially CD4* cells, generally comprise T cell epitopes that can be presented in the context of class II 
histocompatibility molecules. Compositions directed at stimulating cytotoxic T cells and their 
precursors, especially CDS* cells, generally comprise T cell epitopes that can toe presented in the 

40 context of class I histocompatibility molecules. 
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In the ptxJtection of an individual against a future exposure with herpes vims, an antibody 
response may be sufficient. Prophylactic compositions preferably compnse components that elicit a B 
cell response. Successful eradication of an ongoing herpes vinjs infection may involve the participation 
of cytotoxic T cells. T helper-lnducer cells, or both. Infections for treating ongoing infection preferably 
5 comprise components capable of eliciting both T helper cells and cytotoxic T celts. Compositions that 
preferentiaHy stimulate Type 1 helper CThi) cells over Type 2 helper (Th^) cells are even more prefen-ed. 
The preparation and testing of suitable compositions for active vaccines is outlined in the sections that 
follow. 

Another series of embodiments relates to passive vaccines and other materials for adoptive 

1 0 transfer. These compositions generally comprise specific immune components against Glycoprotein B 
that are immediately ready to participate in viral neuti^Iization or eradication. Therapeutic methods 
using these compositions are preferred to prevent pathologic consequences of a recent viral exposure. 
They are also preferred in immunocompromized Individuals incapable of mounting a sufficient immune 
response to an active vaccine. Such individuals include those with congenital immunodeficiencies. 

15 acquired immunodeficiencies (such as those infected with HIV or on kidney dialysis), and those on 
immunosuppressive therapies, for example, with corticosteroids. 

Suitable materials for adoptive transfer include specific antibody against Glycoprotein B, as 
described below. Also included are the adoptive transfer of immune cells. For example. T celts 
reactive against Glycoprotein B may be talcen from a donor individual, optionally cloned or cultured in 

20 vitro, and then transferred to a histocompatible recipient, f^ore preferably, the transfened cells are 
autologous to the recipient, and stimulated in vitro. Thus, T cells are purified from the individual to be 
treated, cultured in the presence of immunogenic components of Glycopratein B and suitable 
stimulatory factors to elicit virus-specific cells, and then readmtnistered. 

Certain compositions embodied herein may have properties of both active and passive vaccines. 

25 For example. Glycoprotein B antibody given by adoptive transfer may confer immediate protection 
against herpes virus, and may also stimulate an ongoing response, through an anti-idiotype network, or 
by enhancing the immune presentation of viral antigen. 

Vaccines comprising Glycoprotoin B pofypeptides 

30 

Specific components of vaccines to stimulate an immune response against Glycoprotein B 
include the intact Glycoprotein B molecule, and fragments of Glycoprotein B that are immunogenic in 
the host. 

Intact Glycoprotein B and longer fragments thereof may t>e prepared by any of the metnods 
35 described eariier, especially purification from a suitable expression vector comprising a Glycoprotein B 
encoding polynucleotide. Isolated Glycoprotein B from other viral strains stimulate a protective immune 
response (See U.S. Patent Nos. 5.171,568: Burke et al.}. Preferred fragments comprise regions of the 
molecule exposed on the outside of the intact viral envelope: located within about 650 amino acids of 
the N'terminal of the mature protein. 

60 
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Gtycosylation of Glycoprotein B is not required for immmogenicity (O'Donnell et al.). Hence, 
glycosylated and unglycosylated forms of the molecule are equally preferred. Gtycosylation may be 
determined by standard techniques; for example, comparing the mobility of the protein in SDS 
polyacrytamide gel electrophoresis before and after treating with commercially available 
5 endogtycosidase type F or H. 

Smaller fragments of 5-50 amino acids comprising particular epitopes of Glycoprotein B are also 
suitable vaccine components. These may be prepared by any of the methods described earlier; most 
conveniently, by chemical synthesis. Prefenred fragments are those which are immunogenic and 
expressed on the outside of the viral envelope. Even more prefen^d are fragments implicated in a 
10 biological function of Glycoprotein 8. such as binding to cell surface receptors or penetration of the vinjs 
into a target cell. 

Immunogenicity of various epitopes may be predicted by algorithms known in the art. Antigenic 
regions for B cell receptors may be determined, for example, by identifying regions of variable polarity 
(Hopp et al., see Example 9). Antigenic regions for T cell receptors may be determined, for example. 

15 by identifying regions capable of forming an amphipathic helix in the presentation groove of a 
histocompatibility molecule. Antigenic regions may also be identified by analogy with Glycoprotein 8 
molecules of other viral species. See, e.g., Sanchez-Pescador et al. and Mester et al.. for B cell 
epitopes of HSV1; Liu et al. for HLA-rcstricted helper T cell epitopes of hCMV; and Hanke et al. for 
cytotoxic T lymphocyte epitopes of HSV1 . 

20 Immunogenicity of various epitopes may be measured experimentally by a number of different 

techniques. Generally, these involve preparing protein fragments of 5-20 amino acids in length 
comprising potential antigenic regions, and testing them in a specific bioassay. Fragments may be 
prepared by CNBr and/or proteolytic degradation of a larger segment of Glycoprotein B. and purified, for 
example, by gel electrophoresis and blotting onto nitrocellulose (Demotz et aL). Fragments may also 

25 be prepared by standard peptide synthesis (Schumacher et at.. Liu et al.). (n a preferred method, 
consecutive peptides of 12 amino acids overlapping by 8 residues are synthesized according to the 
entire extracellular domain of the mature Glycoprotein B molecule, using F-Moc chemistry on a nylon 
membrane support {see Example 11). 

Reactivity against the prepared fragment can then be determined in samples from individuals 

30 exposed to the intact virus or a Glycoprotein B comporwnt. The individual may have been 
experimentally exposed to the Glycoprotein B component by deliberate administration. Altematively. 
the individual may have a naturally occurring viral infection, preferably confirmed by a positive 
amplification reaction using a virus-specific oligonucleotide prol>e to Glycoprotein B or DNA 
Polymerase. Blood samples are obtained from the individual, and used to prepare serum, T cells, and 

35 peripheral blood mononuclear cells (PBr^^C) by standard techniques. 

Serum may be tested for the presence of Glycoprotein B specific antibody in an enzyme-linked 
immuno5ort>ant assay. For example, peptides attached to a solid support such as a nylon membrane 
are incubated with the serum, washed, incubated with an enzyme-linked anti-immunoglobulin, and 
developed using an enzyme substrate. The presence of antibody against a particular Glycoprotein B 
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peptide is indicated by a higher level of reaction product in the test well than in a well containing an 
unrelated peptide (Example 11). 

Lymphocyte preparations may be tested for the presence of Glycoprotein B specific helper T 
cells in a proliferation assay. Approximately 2x10* helper T cells are incubated with the peptide at 1 0" 
5 ^-lO"^ M in the presence of in^diated autologous or in^diated 10^ PBMC as antigen presenting cells for 
about 3 days. ['H]Thymidine is added for about the last 16 h of culture, The cells are then harvested 
and washed. Radioactivity in the washed cells at a level of about 10 fold over those cultured in the 
absence of peptide reflects proliferation of T cells specific for the peptide (Liu et at.). If desired, cells 
with a CD3*4*8" phenotype may be cloned for further characterization of the helper T celt response. 

10 Lymphocyte preparations may be tested for the presence of Glycoprotein B specific cytotoxic T 

cells in a **Cr release assay. Targets are prepared by infecting allogeneic cells wrth a herpes vinjs 
comprising an expressible Glycoprotein B gene. Alternatively, allogeneic celts transfected with a 
Glycoprotein B expression vector may be used. The targets are incubated with ^^Cr for about 90 min at 
37*C and then washed. About 5x10* target cells are incubated with lO'^-IO'' M of the peptide and 

15 0.1-2 X 10* test T cells for at>out 30 min at 37'C. Radioactivity released into the supernatant at a level 
substantially above that due to spontaneous lysis reflects CTL activity. If desired, cells with a CDSM'B* 
. phenotype may be cloned for further characterization of the CTL response. 

Glycoprotein B peptides may optionally t>e combined in a vaccine with other peptides of the 
same virus. Suitable peptides include peptides of any of the other components of the herpes virus, 

20 such as Glycoproteins C, D, H, E, t, J. and G. Glycoprotein B peptides may also optionally be 
combined with immunogenic peptides from different viruses to provide a multivalent vaccine against 
more than one pathogenic organism. Peptides may be combined by preparing a mixture of the 
peptides in solution, or by synthesizing a fusion protein in which the vanous peptide components are 
linked. 

25 Forms of Glycoprotein B comprising suitable epitopes may optionally be treated chemically to 

enhance their immunogenlclty, especially if they comprise 100 amino acids or less. Such treatment 
may include cross-linking. for example, with glutaraldehyde: linking to a protein carrier, such as keyhole 
limpet hemocyanin (KLH) or tetanus toxoid. 

The peptide or peptide mixture may be used neat, but normally will be combined with a 

30 physiologically and pharmacologically acceptable excipient. such as water, saline, physiologically 
buffered saline, or sugar solution. 

In a preferred emt}odiment. an active vaccine also comprises an adjuvant which enhances 
presentation of the immunogen or otherwise accentuates the immune response against the 
immunogen. Suitable adjuvants include alum, aluminum hydroxide, beta-2 microglobulin (WO 

35 91/16924; Rock et al ). muramyl dipeptides. muramyl tripeptides (U.S. Patent No. 5.171.56B: Burke et 
al.), and monophosphoryt lipid A (U.S. Patent No 4,436,728: Ribi et al.: and WO 92/16231: Francotte et 
al ). Immunomodulators such as Interteukin 2 may also be present. The peptide and other components 
(if present) are optionally encapsulated in a liposome or microsphere. For an outline of the 
experimental testing of various adjuvants, see U.S. Patent No. 5,171,568 (Bur^e et al ). A variety of 
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adjuvants may be efficacious. The choice of an adjuvant will depend at toast in part on the stability of 
the vaccine in the presence of the adjuvant, the route of administration, and the regulatory acceptability 
of the adjuvant, particularly when intended for human use. 

Polypeptide vaccines generally have a broad range of effective latitude. The usual route of 
5 administration is intramuscular, but preparations may also be developed which are effective given by 
other routes, including intravenous, intraperitoneal, oral, intranasal, and by inhalation. The total amount 
of Glycoprotein B polypeptide per dose of vaccine when given intramuscularly will generally be about 10 
ixg to 5 mg: usually about 50 |ig to 2 mg: and more usually about 100 to 500 ^g. The vaccine is 
preferably administered first as a priming dose, and then again as a boosting dose, usually at least four 
1 0 weeks later. Further boosting doses may be given to enhance or rejuvenate the response on a periodic 
basis. 

Vaccines comprising viral particles expressing Glycoprotein B 

15 Active vaccines may also be prepared as particles that express an immunogenic epitope of 

Glycoprotein 8. 

One such vaccine comprises the L-particle of a recombinant herpes virus (see U.S. Patent No. 
5,284.122: Cunningham et al.). The genome of the recombinant virus is defective in a capsid 
component, or otherwise prevented from fomiing intact virus; however, it retains the ability to make 

20 L-particles. The genome is engineered to include a Glycoprotein B encoding polynucleotide of the 
present invention operatively linked to the controlling elements of the recombinant vims. The virtis is 
then grown, for example, in cultured cells, and the particles are purified by centrifugation on a suitable 
gradient, such as FICOLL^". Such preparations are free of infective vims, and capable of expressing 
peptide components of a numt>er of different desirable epitopes. 

25 Another such vaccine comprises a live vims that expresses Glycoprotein B of the present 

invention as a heterologous antigen. Such vimses include HIV. SIV. FIV. equine infectious anemia, 
visna vims, and herpes vimses of other species. The vims should be naturally non-pathogenic in the 
species to be treated; or alternatively, it should be attenuated by genetic modification, for example, to 
reduce replication or vimlence. Herpes vims may be attenuated by mutation of a gene involved in 

30 replication, such as the DNA Polymerase gene. Herpes vims may also be attenuated by deletion of an 
essential late-stage component, such as Glycoprotein H (WO 92/05263: Inglis et al.). A live vaccine 
may be capable of a low level of replication in the host, particularly if this enhances protein expression, 
but not to the extent that it causes any pathological manifestation in the subject being treated. 

A prefen-ed viral species for preparing a live vaccine is adenovlms. For human therapy, human 

35 adenovims types 4 and 7 have been shown to have no adverse affects, and are suitable for use as 
vectors. Accordingly, a Glycoprotein B polynucleotide of the present invention may be engineered, for 
example, into the E1 or E3 region of the viral genome. It is known that adenovims vectors expressing 
Glycoprotein B from HSV1 or HSV2 stimulate the production of high tiler vims- neutralizing antibody 
(McDermott et al.). The response protects experimental animals against a lethal challenge with the 

40 respective live vims. 

(p3 
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Also preferred as a virus for a live recombinant vaccine ts a recombinant pox virus, especially 
vaccinia. Even more preferred are strains of vaccinia virus wfilcb have been modified to inactivate a 
non-essential virulence factor, for example, by deletion or insertion of an open reading frame relating to 
ttie factor (U.S. Patent No. 5,364,773: Paoletti et al.). To prepare the vaccine, a Glycoprotein B 
encoding polynucleotide of the present invention is genetically engineered into the viral genome and 
expressed under control of a vaccinia vinjs promoter. Recombinants of this type may be used directly 
for vaccination at at»ut 10^-10* plaque-forming units per dose. Single doses may be sufficient to 
stimulate an antibody response. Vaccinia virus recombir^ants comprising Glycoprotein B of HSV1 are 
effective in protecting mice against lethal HSV1 infection (Cantin et al ). 

Arwther vaccine In this category is a self-assembfing replicatron^efective hybrid virus. See, for 
example. WO 92/05263 (Inglis et a!.). The particle may contain, for example, capsid and envelope 
glycoproteins, but not an intact viral genome. As embodied in this invention, one of the glycoproteins in 
the viral envelope is Glycoprotein B. 

In a preferred embodiment, the particle is produced by a viral vector of a first species, having a 
sufficrent segment of the genome of that species to replicate, along with encoding regions for a capsid 
and an envelope from a heterologous species (see U.S. Patent No. 5,420,026: Payne). Genetic 
elements of the first species are selected such that infection of eukaryotic cells with the vector produces 
capsid and envelope glycoproteins that self-assemble into replication-defective particles. In a variant of 
this embodiment, polynucleotides encoding the capsid and envelope glycoproteins are provided in two 
separate vectors derived from the first viral species. The capsid encoding regions may be derived from 
a lentivirus, such as HIV, SIV, FIV, equine infectious anemia virus, or visna virus. The envelope 
encoding regions comprise a Glycoprotein B encoding polynucleotide of the present invention. 
Preferably, all envelope components are encoded by a herpes virus, particularly of the RFHV/KSHV 
subfamily. The defective viral particles are obtained by infecting a susceptible eukaryotic cell line such 
as BSC-40 with the vector(s) and harvesting the supernatant after about 16 hours. Viral particles may 
be further purified, if desired, by centrifugation through a sucrose cushion. Particles may also be 
treated with 0.8% formalin at 40*C for 24 hours prior to administration as a vaccine. 

Vaccines comprising a live attenuated virus or virus analog may be lyophllized for refrigeration. 
Diluents may optionally include tissue culture medium, sorbitol, gelatin, sodium bicartwnate, albumin, 
gelatin, saline solution, phosphate buffer, and sterile water. Other active components may optionally be 
added, such as attenuated strains of measles, mumps, and rubella, to produce a polyvalent vaccine 
The suspension may be lyophiltzed, for example, by the gas injection technique. This is performed by 
placing vials of vaccine in a lyophiliztng chamt}er precooled to about ^5*C with 10-18 Pa of dry sterile 
argon, raising the temperature about &-25*C per h to *20*C, conducting a second lyophilizing cycle with 
full vacuum, and then sealing the vials under argon in the usual fashion (see EP 029019761 : Mcaleer et 
al.). For vaccines comprising live herpes virus, the final tyophtlized preparation will preferably contain 
2-8% moisture. 

It is recognized that a number of altemative compositions for active vaccines, not limited to those 
described here in detail, may be efficacious in eliciting specific B- and T-cell immunity. All such 
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compositions are embodied in the spirit of the present invention, providing they include a RFHV/KSHV 
subfamily Glycoprotein B polynucleotide or polypeptide as an active ingredient. 

Vaccines comprising Gtycoprotein B antibodies 

5 

Antibody against Glycoprotein B of the RFHV/KSHV subfamily may be administered by adoptive 
transfer to immediately confer a level of humoral immunity in the treated subject. Passively 
administered anti -glycoprotein B experimentally protects against a lethal challenge with other herpes 
vinjses. even in subjects with compromised T-cell immunity (Eis-Hubinger et al.). 

-10 xhe antibody molecule used should be specific for Glycoprotein B against which protection is 

desired. It should not cross-reactive with other antigens, particularly endogenous antigens of the 
subject to be treated. The antibody may be specific for the entire RFHV/KSHV subfamily (Class II 
antibodies), or for a particular virus species (Class III antibodies), depending on the objective of the 
treatment. Preferably, the antibody will have an overall affinity for a polyvalent antigen of at least about 

15 10' W^: more preferably it will be at least about 10'** M ^ more preferably it wiW be at least about 10" 
M"': even more preferably, it will be 10" M*^ or more. Intact antibody molecules, recombinants, fusion 
proteins, or antibody fragments may be used; however, intact antibody molecules or recombinants able 
to express natural antibody effector functions are preferred. Relevant effector functions include but are 
not limited to virus aggregation; antibody-dependent cellular cytotoxicity; complement activation; and 

20 opsonization. 

Antibody may be prepared according to the description provided in an earlier section. For 
systemic protection, the antibody is preferably monomeric. and preferably of the IgG dass. For 
mucosal protection, the antibody may be polymeric, preferably of the IgA class. The antibody may be 
eitiier monoclonal or polyclonal; typically, a cocktail of monoclonal antibodies is prefen-ed. It is also 
25 prefen-ed that the preparation be substantially pure of other biological components from the original 
antibody source. Other antibody molecules of desired reactivity, and carriers or stabilizers may be 
added after purification. 

In some instances, it is desirable that the antibody resemble as closely as possible an antibody 
of the species which is to be treated. This is to prevent the administered antibody from becoming itself 

30 a target of the recipient's immune response. Antibodies of this type are especially desirable when the 
subject has an active immune system, or when tiie antibodies are to be administered in repeat doses. 

Accordingly, this invention embodies anii-GlycoproteIn B antibody which is human, or which has 
t>een humanized. Polyclonal human antibody may be purified from the sera of human individuals 
previously infected with the respective RFHV/KSHV subfamily herpes viajs. or from volunteers 

35 administered with an active vaccine. Monoclonal human antibody may be produced from the 
lymphocytes of such individuals, obtained, for example, from peripheral blood. In general, human 
hybridomas may be generated according to the methods outlined eariier. Usually, the production of 
stable human hybridomas will require a combination of manipulative techniques, such as both fusion 
with a human myeloma cell line and transformation, for example, with EBV. 

OS 
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In a preferred method, human antibody is produced from a chimeric norvprimate animal with 
functional human immunoglobulin loci (WO 91/10741: Jakobovits et al.). The non-primate animal strain 
(typically a mouse) is incapable of expressing endogenous immunoglobulin heavy chain, and optimally 
at least one endogenous immunoglobulin light chain. The animals are genetically engineered to 
5 express human heavy chain, and optimally also a human tight chain. These animals are immunized 
with a Glycoprotein B of the RFHV/KSHV subfamily of herpes viruses. Their sera can then be used to 
prepare polyclonal antibody, and their lymphocytes can be used to prepare hybridomas in the usual 
fashion. After appropriate selection and purification, the resultant antibody is a human antibody with the 
desired specifidty. 

10 In another preferred method, a monoclonal antibody with the desired specificity for Glycoprotein 

B is first developed in another species, such as a mouse, and then humanized. To humanize the 
antibody, the polynucleotide encoding the specific antibody is isolated, antigen birKling regions are 
obtained, and then recombined with polynucleotides encodirvg elements of a human immunoglobulin of 
unrelated specificity. Alternatively, the nucleotide sequence of the specific antibody is obtained and 

1 5 used to design a related sequence with human characteristics, which can be prepared, for example, by 
chemical synthesis. The heavy chain constant region or the light chain constant region of the specific 
antibody, preferably both, are substituted with the constant regions of a human immunoglobulin of the 
desired class. Preferably, segments of the variable region of both chains outside the complementarity 
determining regions (CDR) are also substituted with their human equivalents (EP 0329400: Winter). 

20 Even more preferably, segments of the variable region are substituted with their human 

equivalents, providing they are not involved either in antigen binding or maintaining the structure of the 
binding site. Important amino acids may be identified, for example, as described by Padlan. In one 
particular technique (WO 94/11509: Couto et al ), a positional consensus sequence is developed using 
sequence and crystallography data of known immunoglobulins. The amino add sequence of the 

25 Glycoprotein B specific antibody is compared with the model sequence, and amino adds involved in 
antigen binding, contact with COR's, or contact with opposing chains are identified. The other amino 
adds are altered, where necessary, to make them conform to a consensus of human immunoglobulin 
sequences. A polynucleotide encoding the humanized sequence is then prepared, transfected into a 
host cell, and used to produce humanized antibody with the same Glycoprotein B specificity as the 

30 originally obtained antibody clone. 

Spedfic antibody obtained using any of these methods is generally sterilized, mixed with a 
pharmaceutically compatible exdpient. Stabilizers such as 0.3 molar glycine, and preservatives such 
as 1:10,000 thimerosal, may also be present. The suspension may be buffered to neutral pH (-7.0), for 
example, by sodium carbonate. The potency may optionally t>e adjusted by the addition of normal 

35 human IgG, obtained from large pools of normal plasma, for example, by the Cohn cold ethanol 
fracti creation procedure. Other diluents, such as albumin solution, may be used as an altemative. The 
concentration is adjusted so that a single dose administration constitutes 0.005-0.2 mg/kg. preferably 
at}out 0.06 mg/kg. A single dose preferably results in a circulating level of a nti- Glycoprotein B, as 
detected by ELISA or other suitable technique, which are comparable to those observed in individuals 
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who have received an active Glycoprotein B vaccine or have recovered from an acute Infection with the 
con-esponding virus, or which are known from experimental woric to be protective against challenges 
with a pathologic dose of virus. 

Administration should generally be perfomied by intramuscular injection, not intravenously, and 
5 care should be taken to assure that the needle is not in a blood vessel. Special care should be taken 
with individuals who have a history of systemic allergic reactions following administration of human 
globulin. For prophylactic applications, the antibody preparation may optionally be administered in 
combination with an active vaccine for Glycoprotein B. as described in the preceding sections. For 
post-exposure applications, the antibody preparation is preferably administered wrthin one week of the 

10 exposure, more preferably within 24 hours, or as soon as possible after the exposure. Subsequent 
doses may optionally be given at approximately 3 month intervals. 

As for all therapeutic instruments described herein, the amount of composition to be used, and 
the appropriate route and schedule of administration, will depend on the clinical status and 
requirements of the particular individual being treated. The choice of a particular regimen is ultimately 

1 5 the responsibility of the prescribing physician or veterinarian. 

The foregoing description provides, inter alia, a detailed explanation of how Glycoprotein B 
encoding regions of herpes vimses. particutariy those of the RFHV/KSHV subfamily, can be identified 
and their sequences obtained. Polynucleotide sequences for encoding regions of Glycoprotein B of 

20 both RFHV and KSHV are provided. 

The polynucleotide sequences listed herein for RFHV and KSHV are believed to be an accurate 
rendition of the sequences contained in the polynucleotides from the herpes vinjses in the tissue 
samples used for this study. They represent a consensus of sequence data obtained from multiple 
clones. However, it is recognized that sequences obtained by amplification methods such as PGR may 

25 comprise occasional errors in the sequence as a result of amplification. The en-or rate is estimated to 
be between about 0.44% and 0.75% for single determinations; about the same rate divided by y){n-^)toT 
the consensus of n different determinations. Nevertheless, the error rate may be as high as 1% or 
more. Sequences free of amplification errors can be obtained by creating a library of herpes virus 
polynucleotide sequences, using oligonucleotides such as those provided in Table 7 to select relevant 

30 clones, and sequencing the DNA in the selected clones. The relevant methodology is wrell known to a 
practitioner of ordinary skill in the art. who may also wish to refer to the description given in the Example 
section that follows. 

It is recognized that allelic variants and escape mutants of herpes viruses occur. 
Polynucleotides and polypeptides may be isolated or derived that incorporate mutations, either naturally 
35 occurring, or accidentally or deliberately induced. wiUiout departing from the spirit of this invention. 

The examples presented below are provided as a further guide to a practitioner of ordinary skill 
in the art. and are not meant to be limiting in any way. 

(^7 
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Examples 

Examp/B 1: OfigonuciBOtlde primers for Herpes Virus Glycoprotein B 

Amino acid sequences of known herpes virus Gtycoprotein 6 molecules were obtained from the 
PIR protein database, or derived from DMA sequences obtained from the GenBank database. The 
sequences were aligned by computer-aided alignment programs and by hand. 

Results are shown in Figure 3. sHVI. bHV4, mHV68, EBV and hHV6 sequences were used to 
identify regions that were relatively well conserved, particularly amongst the gamma herpes viruses. 
Nine regions were chosen for design of amplification primers. The DNA sequences for mese regions 
were then used to design the oligonucleotide primers. The primers were designed to have a 
degenerate segment of 8-14 base pairs at the 3' end, and a consensus segment of 1 8-30 bases at the 
5' end. This provides primers with optimal sensitivity and specificity. 

The degenerate segment extended across highly conserved regions of herpes virus Glycoprotein 
B sequences, encompassing the least numt>er of attemative codons. The primers ccutd therefore t>e 
synthesized with attemative nucleotide residues at the degenerate positions and yield a minimum 
number of combinations. There were no more than 256 alternative forms for each of the primers 
derived. 

The consensus segment was derived from the corresporxling flanking region of the Glycoprotein 
B sequences. Generally, the consensus segment was derived by choosing the most frequently 
occurring nucleotide at each position of all the Glycopmtein B sequences analyzed. However, selection 
was biased in favor of C or G nucleotides, to maximize the ability of the primers to form stable duplexes 

Results are shown In Figures 4-12. and are summarized in Table 4. In a PGR, oligonudeotides 
listed in Table 4 as having a 'sense* orientation would act as primers by hybridizing with the strand 
antisense to the coding strand, and initiating polymerization in the same direction as the Glycoprotein B 
encoding sequence. Oligonucleotides listed in Table 4 as having an 'antisense' orientation would 
hybridize with the coding strand and initiate polymerization in the direction opposite to that of the 
Glycoprotein B encoding sequence. 

Synthetic oligonucleotides according to the designed sequences were ordered and obtained from 
Oligos Etc, Inc. 

Example 2: DNA extraction 

Biopsy specimens were obtained from Kaposi's sarcoma lesions from human subjects 
diagnosed with AIDS. The specimens were fixed in parafomnaldehyde and embedded in paraffin, which 
were processed for normal histological examination. 

Fragments of the paraffin samples were extracted with 500 nL of xylene in a 1.5 mL 
EPPENDORF"* conical centrifuge tube. The samples were rocked gently for 5 min at room 
temperature, and the tubes were centrifuged in an EPPENDORF^** bench-top centrifuge at 14,000 rpm 
for 5 min. After removing the xylene with a Pasteur pipette. 500 \iL of 95% ethanol was added, the 
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sampte was resuspended, and then re-centrifuged. The ethanol was removed, and the wash step was 
repeated. Samples were then air-dried for about 1 hour. 500 ^L of proteinase-K buffer (0.5% TWEEN™ 
20, a detergent; 50 mM Tris buffer pH 7.5. 50 mlV! NaCI) and 5 tiL of proteinase K (20 mg/mL) were 
added, and the sample was incubated for 3 h at 55*C. The proteinase K was inactivated by incubating 
5 at 95*C for 10 min. 

Samples of DNA from KS tissue were pooled to provide a consistent source of polynucleotide for 
the amplification reactions. This pool was known to contain DNA from KSHV. as detected by 
amplification of KSHV DNA polymerase sequences, as described in commonly owned U.S. Patent 
Application 60/001 ,148. 

10 

Example 3: Obtaining amplified segments of KSHV Giycoprotein B 

The oligonucleotides obtained in Example 1 were used to amplify segments of the DNA 
extracted from KSHV tissue in Example 2. according to the following protocol. 

15 A first PCR reaction was conducted using 2 \iL of pooled DNA template. 1 ^iL of oligonucleotide 

FRFDA (50 pmol/jiL), 1 jiL of oligonucleotide TVNCB (50 pmoU^^L). 10 jiL of lOx buffer, 1 ^iL containing 
2.5 mM of each of the deoxyribonudeotide triphosphates (dNTPs). 65 tiL distilled water, and 65 nL 
mineral oil. The mixture was heated to BO'C in a Pericin-Elmer (model 480) PCR machine. 0.5 mU Taq 
polymerase (BRL, 5 U/nL) and 19.5 ul water was then added. 35 cycles of amplification were 

20 conducted in the following sequence: 1 min at 94"C, 1 min at the annealing temperature, and 1 min at 
72'C. The annealing temperature was BO'C in the first cycle, and decreased by 2*C each cyde until 
50'C was reached. The remaining cycles were perfomied using 50*C as the annealing temperature. 

A second PCR reaction was conducted as follows: to 1 nL of the reaction mixture from the 
previous step was added 1 olrgonudeotide NIVPA (50 pmoUnL). 1 oligonucleotide TVNCB (50 

25 pmol/»iL). 10 jiL of lOx buffer. 1 mL dNTPs. 66 jiL water, and 65 mL mineral oil. The mixture was heated 
to 80'C. and 0.5 nL Taq polymerase in 19.5 ^iL water was added. 35 cycles of amplification were 
conducted using the same temperature step-down procedure as before. The PCR product was 
analyzed by electrophoresing on a 2% agarose gel and staining with ethidium bromide. 

The two-round amplification procedure was perfomied using fourteen test buffers. Five buffers 

30 yielded PCR product of about the size predicted by analogy with other herpes sequences. These 
induded WB4 buffer {10x WB4 buffer is 0.67 M Tris buffer pH 8.8. 40 mM MgClj. 0.16 IVI (NH4)2S04. 0.1 
M ^-mercaptoethanol, 1 mg/mL bovine senjm albumin, which is diluted 1 to 10 in the reaction). Also 
tested was WB2 buffer (the same as WB4 buffer, except with 20 mM MgClj in the lOx concentrate). 
Also tested were buffers that contained 10 mM Tris pH 8.3, 3.5 mM MgClj and 25 mM KCI; or 10 mM 

35 Tris pH 8.3. 3.5 mM MbClj and 75 mM KCI; or 10 mM Tris pH 8.8. 3.5 mM MgClj and 75 mM KCI; when 
diluted to final reaction volume. The WB4 buffer showed the strongest band, and some additional 
fainter bands. This may have been due to a greater overall amount of labeled amplified polynudeotide 
in the WB4 sample. 

The product from amplification with WB2 buffer was selected for further investigation. A third 
40 round of amplification was performed to introduce a radiolabel. The last-used oligonudeotide frVNCB) 
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is end-labeled with gamma "P-ATP. and 1 pL was added to 20 ^^L of the reaction mixture from the 
previous amplification step, along with 1 pL 2.5 mM dNTTP. The mixture was heated to 80'C, and 0.5 pL 
Taq polymerase was added. Amplification was conducted through five cydes of 94'C. 60'C and 72'C. 
The reaction was stopped using 8.8 jiL of loading buffer from a Circumvent sequencing gel kit. 
5 A III aliquot of the radiolabeled reaction product was electro phoresed on a 6% 

polyacrylamide sequencing gel for 1.5 h at 5 VC. The gel was dried for 1.5 h, and an autorad to graph 
was generated by exposure for 12 h. Two bands were identified. The larger band had the size 
expected for the fragment from analogy with other gamma herpes virus sequences. 

The larger band was marked and cut out. and DNA was eluted by incubation in 40 TE buffer 
10 (10 mM Tris and 1 mM EDTA. pH. 8.0). A further amplification reaction was performed on the extracted 
DNA. using 1 jiL of the eluate. 10 uL 10x WB2 butter. 1 \il 2.5 mM dNTP. 1 ^iL of each of the second 
set of oligonucleotide primers (NIVPA and TVNCB), and 65 jiL water. The mixture was heated to 80'C. 
and 0.5 Taq polymerase in 19.5 ^iL water was added. Amplification was conducted through 35 
cycles, using the temperature step-down procedure described earlier. 

15 

Example 4: Sequence of the 386 base fragment of KSHV Glycoprotein B 

The amplified polynucleotide fragment from the Glycoprotein B gene of KSHV was purified and 
cloned according to the following procedure. 

20 40 mL of amplification product was njn on a 2% agarose gel. and stained using 0.125 pg/mL 

ethydium bromide. The single band at about 400 base pairs was cut out. and purified using a 
QIAGEN'** II gel extraction kit. according to manufacturer's instructions. 

The purified PGR product was ligated into the pGEM^**-! cloning vector. The vector was used to 
transform competent bacteria (E. coti JM-109). Bacterial clones containing the amplified DNA were 

25 picked and cultured. The bacteria were lysed and the DNA was extracted using phenol-chloroform 
followed by precipitation with ethanol. Colonies containing inserts of the correct size were used to 
obtain DNA for sequencing. The clone inserts were sequenced from t»th ends using vector-specific 
oligonudeotides (fon<vard and reverse primers) with a SEQUENASE^ 7-dea2a dGTP kit, according to 
manufacturer's directions. A consensus sequence for the new fragment was obtained by combining 

30 sequence data obtained from 5 clones of one KSHV Glycoprotein B amplification product. 

The length of the fragment in between the primer hybridizing regions was 319 base pairs. The 
nucleotide sequence is listed as SEQ. ID N0:3 and shown in Figure 1. The encoded polypeptide 
sequence is listed as SEQ. ID NO:4. 

Figure 1 3 compares the sequence of this Glycoprotein B gene fragment with the con-esponding 

35 sequence of other gamma herpes viruses. Single dots (.) indicate residues in other gamma herpes 
viruses that are identical to those of the KSHV sequence. Dashes (-) indicate positions where gaps 
have been added to provide optimal alignment of the encoded protein. The lor>gest stretch of 
consecutive nudeotides that is identical between the KSHV sequence and any of the other listed 
sequences is 14. Short conserved sequences are scattered throughout the fragment. Overall, the 
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polynucleotide fragment is 63% identical between KSHV and the two closest herpes virus sequences. 
sHVI and bHV4. 

The sequence data was used to design Type 3 oligonucleotide primers of 20-40 base pairs in 
length. The primers were designed to hybridize preferentially wth the KSHV Glycoprotein B 
5 polynucleotide, but not with other sequenced polynucleotides encoding Glycoprotein B. Example 
primers of this type were listed earlier in Table 7. 

Figure 14 compares the predicted amino acid sequence encoded by the same Glycoprotein B 
gene fragment. At the amino acid level, two short segments are shared between KSHV and a 
previously known gamma herpes virus, bHV4. The first (SEQ. ID NO:B4) is 13 amino acids in length 
10 and located near the N-terminal end of the fragment. Ttie second (SEQ. ID NO;65) is 15 amino acids in 
length and located near the C-terminal end of the fragment. All other segments shared twtween KSHV 
and other gamma herpes vinjses are 9 amino acids or shorter. 



15 



Example 5: Sequence of the 386 base fragment of RFHV Glycoprotein B 



Tissue specimens were obtained from the tumor of a Macaque nemesthna monkey at the 
University of Washington Regional Primate Research Center. The specimens were fixed in 
paraformaldehyde and eml>edded in paraffin. DNA was extracted from the specimens according to the 
procedure of Example 2. 

20 The presence of RFHV polynucleotide in DNA preparations was determined by conducting PCR 

amplification reactions using oligonucleotide primers hybridizing to the DNA polymerase gene. Details 
of this procedure are provided in commonly owned U.S. Patent Application 60/001,148. DNA extracts 
containing RFHV polynucleotide determined in this fashion were pooled for use in the present study. 

DNA preparations containing RFHV polynucleotide served as the template in PCR amplification 

25 reactions using Glycoprotein B consensus-degenerate oligonucleotides FRFDA and TVNCB. followed 
by a second round of amplification using oligonucleotides NIVPA and TVNCB. Conditions were 
essentially the same as in Example 3, except that only WB4 buffer produced bands of substantial 
intensity, with the amount of DNA in the initial source and the conditions used. Labeling of the amplified 
DNA was performed with "P end-labeled NIVPA. as before: the product was electrophoresed on a 6% 

30 polyacrylamide gel, and an autoradiogram was obtained. A ladder of bands corresponding to about 386 
base pairs and about 10 higher mol wt concatemers was observed. The 386 base pair band (with the 
same mobility as a simultaneously njn KSHV fragment) was cut out of the gel and extracted. 

To detemnine whether the DNA in this extract was obtained from a specific amplification reaction. 
PCR s were set up using NIVPASQ alone, TVNCBSQ alone, or the two primers together. Buffer 

35 conditions were the same as for the initial amplification reactions. The mixture was heated to 80'C. Taq 
polymerase was added, and the amplification was carried through 35 cycles using the temperature 
step-down procedure. Theoretically, specific amplification reactions accumulate product linearly when 
one primer is used, and exponentially when using two primers with opposite orientation. Thus, 
specificity is indicated by more product in the reaction using both primers, whereas equal product in all 

40 three mixtures suggests non-specific amplification. Am plifi cation products from these test reactions 

7/ 
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were analyzed on an agarose gel stained with ethidium bromide. The RF extract showed no product for 
the NIVPASQ reaction, a moderate staining band for the TVNCBSQ reaction at the appropriate mobility, 
and an intensely staining band for both pnmers together. For a KSHV fragment assayed in parallel, 
there was a faint band for the NIVPASQ reaction, no band for the TVNCBSQ reaction, and an intensely 
5 staining band for both primers together. We concluded that the 386 base pair band in the RF extract 
represented specific amplification product. 

Accordingly, 40 \iL of the RF extract that had been amplified with both primers was mn 
preparatively on a 2% agarose gel. and the -386 base pair band was cut out. Agarose was removed 
using a QIAGEN™ kit, and the product was cloned in E. coli and sequenced as in Example 4. A 
10 consensus sequence was determined for 3 different clones obtained from the same amplified RFHV 
product. 

The polynucleotide sequence of RFHV Glycoprotein B fragment (SEQ. ID NO:1) is aligned in 
Figure 1 with the corresFwnding sequence from KSHV. Also shown is the encoded RFHV amino acid 
sequence (SEQ. ID NO:2). Between the primer hybridization regions (nucleotides 36-354). the 
15 polynucleotide sequences are 76% identical; and the amino acid sequences are 91% identical. The 
internal cysteine residue and the potential N-iinked glycosyiation site are both conserved between the 
two viruses. 

The sequence data was used to design Type 3 oligonucleotide primers of 20-40 base pairs in 
length. The primers were designed to hybridize preferentially with the RFHV Glycoprotein B 
20 polynucleotide, but not with other sequenced polynucleotides encoding Glycoprotein B. Example 
primers of this type were listed eariier in Tabie 7. 

Figure 1 5 compares the predicted amino acid sequence encoded by nucleotides 36-354 of the 
Glycoprotein B gene fragment. As for the KSHV sequence, two short segments are shared between 
RFHV and a previously known gamma herpes vinjs. bHV4 All other segments shared between RFHV 
25 and other gamma herpes viruses are shorter than 9 amino acids in length. 

Figure 16 is an alignment of sequence data for the same Glycoprotein B fragment in the 
spectrum of herpes viruses for which data is available. Figure 17 shows the phylogenetic relationship 
between herpes vinjses. based on the degree of identity across the partial Glycoprotein B amino acid 
sequences shown in Figure 16. By amino acid homology, amongst the viruses shown, RFHV and 
30 KSHV are most closely related to bHV4. eHV2, and sHVI 



Example 6; Oligonucleotide primers and probes for the RFHV/KSHV subfamily 

Based on the polynucleotide fragment obtained for RFHV and KSHV, seven Type 2 
35 oligonucleotides were designed that could be used either as PGR primers or as hybridization probes 
vwth members of the RFHV/KSHV subfamily. 

Four consensus-degenerate Type 2 oligonucleotides. SHMDA. CFSSB. ENTFA. and DNIQB are 
shown in Figure 17, alongside the sequences they were derived fi-om. Like the oligonucleotides of 
Example 1, they have a consensus segment towards the 5' end, and a degenerate segment towards 
40 the 3" end. However, these oligonucleotides are based only on the RFHV and KSHV sequences, and 
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will therefore preferentially form stable duplexes with Glycoprotein B of the RFHV/KSHV subfamily. A 
list of exemplary Type 2 oligonucleotides was provided earlier in Table 6. 

Different Type 2 oligonucleotides have sense or antisense orientations. Primers with opposing 
orientations may be used together in PCR amplifications. Artematively. any Type 2 oligonudeotide may 
5 be used in combination with a Type ^ oligonucleotide with an opposite orientation. 

Example 7: Upstream and downstream Glycoprotein B sequence 

Further amplification reactions are conducted to obtain additional sequence data. The source for 
10 KSHV DMA is Kaposi's Sarcoma tissue, either frozen tissue blocks or paraffin-embedded tissue, 
prepared according to Example 2, or cell lines developed from a cancer with a KSHV etiology, such as 
body cavity lymphoma. Also suitable is KSHV that is propagated in culture (Weiss et al.) 

The general strategy to obtain further sequence data in the 5' direction of the coding strand is to 
conduct amplification reactions using the consensus-degenerate (Type 1) oligonucleotide hybridizing 
15 upstream from the fragment as the 5' primer, in combination with the closest virus-spedfic (Type 3) 
oligonudeotides as the 3' primers. Thus, a first series of amplification cycles are conducted, for 
example, using FRFDA and TNKYB as the first set of primers. This may optionally be followed by a 
second series of amplification cydes. conducted, for example, using FRFDA and GLTEB as a second 
set of primers. 

20 The conditions used are similar to those described in Examples 3 and 4. The reaction is 

performed in WB4 buffer, using the temperature step-down procedure described in Example 3. After 
two rounds of amplificaUon. the product is labeled using the last-used vinjs-spedfic oligonudeotide 
(GLTEB. in this case), end-tabeled with gamma "P-ATP. The labeled product is electrophoresed on 
6% polyacrylamide, and a band conesponding to the appropriate size as predicted by analogy with 

25 other herpes vimses Is exdsed. After re-amplification, the produd is purified, doned. and sequenced 
as before. A consensus sequence for the new fragment Is obtained by combining results of about three 
determinations. 

In order to obtain further sequence data in the 3' direction of the coding strand, amplifications are 
conducted using consensus-degenerate (Type 1) oligonudeotides hybridizing downstream from the 
30 fragment as the 3* primer, in combination with the dosesi vinjs-spedfic (Type 3) oligonucleoUdes as the 
5' primers. In one example, a first series of amplification cycles are conducted using NVFDB and 
TVFLA. optionally followed by a second series conducted using NVFDB and SQPVA. Amplification and 
sequendng Is performed as before. The new sequence is used to design further Type 3 
oligonudeotides with a sense orientation, which are used with other downsUeam-hybridizing Type 1 
35 oligonudeotides (such as FREYB and KfVFDB) to obtain further sequence data. Alternatively, further 
sequence data in the 3' direction is obtained using Type 1 oligonudeotides with opposite orientation: 
for example, two primers are selected from the group of FRFDA. NIVPA. TVNCA. NIDFB, NVFDB. and 
FREYB: additional primers may bo selected for nested amplification. 

To obtain sequence data 3' from the most downstream otigonudeoUde primer. Type 1 primers 
40 such as CYSRA. or Type 3 primers such as TVFLA. may be used in combination with primers 

^7.3 
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hybridizing towards the 5' end of the DNA polymerase gene. Oligonucleotide primers hybridizing to the 
DMA polymerase gene of herpes viruses related to RFHV and KSHV are described in commonly owned 
U.S. Patent Application 60/001,148. The DNA polymerase encoding region Is located 3' to the 
Glycoprotein B encoding region. PCRs conducted using this primer combination are expected to 
5 amplify polynucleobdes comprising the 3' end of the Glycoprotein B encoding region, any intervening 
sequence if present, and the 5' end of the DNA Polymerase encoding region. 
This strategy was implemented as follows: 

DNA containing KSHV encoding sequences for Glycoprotein B was prepared from a frozen 
Kaposi's sarcoma sample, designated RiGr, and a cell tine derived from a body cavity lymphoma. 
10 designated BC~1. 

In order to obtain the full 5' sequence, a Type 1 oligonucleotide probe was designed for the 
encoding sequence suspected of being upstream of Glycoprotein B: namely, the capsid maturation 
gene (CAPMAT). Known sequences of CAPMAT from other viruses were used to identify a relatively 
conserved region, and design a consensus-degenerate primer designated FENSA to hybridize with 
15 CAPMAT in the sense orientation of Glycoprotein B. A Type 1 oligonucleotide probe was designed for 
the encoding sequence suspected of t>eing downstream of Glycoprotein B: namely, the DNA 
polymerase. These oligonucleotides are listed in Table 9: 



TABLE 9: Additional Type 1 OligonucleotldM used for Detecting, Ampltfytng, or Characterizing 

Herpes Virus Potynuclsotldes 



1 Target: C*psfd/¥aturmtion gene from Herpes Vimses. especially from gamma Herpesviruses 


Designation 


Sequence 
(5' to 30 


Length 


No. of 
forms 


Orien-tation 


SEQ 
ID: 


FENSAC 


GCCTTTGAGAATTCYAARTAYATHAAR 


27 


48 


sense 


77 


FENSAG 


GGGTTTGAGAATTCYAARTAYATHAAR 


27 


48 


sense 


78 


Target DNA potymaras9 gmne from Herpesviruses, especJslly from gamms Herpes V)n«ses 


Desigruition 


Sequence 
{5' to 30 


Length 


No. of 
forms 


Ohen-tation 


SEQ 
ID: 


CVNVB 


TAAAAGTACAGCTCCTGCCCGAANACRTTN 
ACRCA 


35 


64 


anttsense 


79 



20 Amplification was carried out using parrs of sense and antisense primers that covered the entire 

Glycoprotein B encoding region. Fragments obtained include those listed in Table 10. 
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TABLE 10: KSHV Glycoprotein B fragments otitatned 




Fragment 


Length 


Poettion 


1 


NIVPA TVNCB 


0.39 kb 


original fragment 


2 


FENSA VNVNB 


O.g kb 


5' of fragnrwnt 1 across to CAPMAT j 


3 


-rVNCA-^FREYB 


2.3 kb 


3' of fragment 1' | 


4 


FAYDA -+ FREYB 


0.65 kb 


3' of fragment 1 


5 


SQPVA HVLQB 


2.5 kb 


3' of fragment 1 across to DNA polymerase 


6 


FREYA SCGFB 


1.1 kb 


3' of fragment 2 across to DNA polymerase 



The protocol used for amplitying and sequencing was as follows: PGR amplification was carried 
out using the DNA template wrth the primer pair (e.g.. FREYA and SCGFB). 35 cycles were conducted 
of 94*C for 45 sec. 60«C for 45 sec. and 72*'C for 45 sec; and then followed by a final extension step at 
5 72*C for 10 min, PGR products of the predicted length were purified on agarose gels using the 
QIAQUICK™ PGR purification krt from Quiagen. Purified PGR products were reamplified in a second 
round of amplification. The second round was conducted alternatively in a nested or non-nested 
fashion. In the example given, second-round amplification was conducted using FREYA and SGGFB, 
or with FREYA and HVLQB. Amplification for 35 cycles was conducted at 94'C for 45 sec, 65*C for 45 

10 sec. and 72'C for 45 sec; and then followed by a final extension step at 72*'C for 60 min. 

The PGR products were ligated Into the Novagen PT7 BLUE™ vector, and transfonned into 
Novablue competent E. coli. Ugations and transfomiations were performed using Novagen protocols. 
Colonies were screened by PGR using Ml 3 fonvard and reverse oligonucleotides. Using the Quiaquick 
piasmid isolation kit, plasmids were isolated from PGR positive colonies that had been grown up 

1 5 overnight in 5 mL LB broth at 37*G. Manual sequencing of the plasmids using Ml 3 fonvard and reverse 
sequencing primers was perfomied following the USB Sequenase Kit protocol (USB). Automated 
sequencing was performed by ABI methods. 

Additional KSHV-specific Type 3 oligonucleotides were designed as the KSHV sequence 
emerged. Type 3 oligonucleotides were used in various pair combinations or with Type 1 

20 oligonucleotides to PGR amplify, clone, and sequence sections of the KSHV DNA. The Type 3 
oligonucleotides used are listed in Table 1 1 : 
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TABLE 11: Addfttonal Type 3 Oligonucleotides used for Detecting, Ampttfylng, or 
Charactertzing Herpes Virus Poty nucleotides encoding Gtycoproteln B 


Target: Glycoprot^n B from KSHV 


Desig- 
nation 


Sequence 

<5' to 3') 


Length 


No. of 
forms 


Orientation 


SEQ 
10: 


GAYTA 


TGTGGAAACGGGAGCGTACAC 


21 


1 


sense 


80 1 


DTYSB 


TCAGACAAGAGTACGTGTCGG 


21 


1 


anti-sense 


81 1 


AIYGB 


TACAGGTCGACCGTAGATGGC 


21 


1 


anti-sense 


82 1 


VTECA 


CGCCATTTCCGTGACCGAGTG 


21 


1 


sense 


83 


CEHYB 


TGATGAAGTAGTGTTCGCAGG 


21 


1 


anti-sense 


84 


DLGGB 


GATGCCACCCAGGTCCGCCAC 


21 


1 


anthsense 


85 


DLGGA 


GTGGCGGACCTGGGTGGCATC 


21 


1 


sense 


86 


RAPPA 


CGTAGATCGCAGGGCACCTCC 


21 


1 


sense 


87 


Target DNA PotymvnsB from KSHV 


Desig- 
nation 


Sequence 

<5' to 3') 


Length 


No. of 
forms 


Orientation 


SEQ 

iO: 


GEVFB 


GTCTCTCCCGCGAATACTTCT 


21 


1 


antisense 


88 


HVLQB 


GAGGGCCTGCTGGAGGACGTG 


21 


1 


antisense 


69 


SCGFB 


CGGTGGAGAAGCCGCAGGATG 


21 


1 


antisense 


90 



Figure 18 is a map showing the location where oiigonucJeotides hybridize with the KSHV DNA. 
Abbreviations used are as follows; d or h = consensus-degenerate probes that hybridize with 
herpesvirus sequences (Type 1). sq = additional sequencing tail available, g = prot>es that hybridize 
5 with gamma herpesviruses (Type 1), f - probes that hybridize with KSHV/RFHV family of herpesviruses 
(Type 2). ks = probes specific for KSHV (Type 3). 

Figure 1 9 lists a consensus sequence obtained by compiling sequence data from each of the 
characterized fragments. The polynucleotide sequence (SEQ. ID NO:91) is shown. Nucleotides 1-3055 
(SEQ. ID NO;92) incorporating the region before the DNA polymerase encoding sequence is an 
10 embodiment of this invention. This consensus sequence represents the consensus of data obtained 
from tKJth the Kaposi's sarcoma sample RiGr. and the lymphoma cell line BC-1. with a plurality of 
clones t>eing sequenced for each sample and each gene segment Between about 3-9 determinations 
have been performed at each location. 

Also shown in Figure 19 is the amino acid translation of the three open reading frames (SEQ. ID 
15 NOS;93-95). The encoded CAPMAT protein fragment (SEQ. ID NO:93) overtaps the 5' end of the 
Glycoprotein B encoding sequence (SEQ. ID NO:94) in a different phase. Further upstream, the 
CAPMAT encoding sequence is also suspected of comprising control elements for Glycoprotein B 
transcription, due to homology with the binding site for RNA polymerase 2 of Epstein Barr Virus. This 
putative promoter region is underiined in the Figure. At the 3" end of the Glycoprotein B encoding 
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sequence, there is an untranslated sequence including a polyadenlyation signal. Further downstream is 
the encoding sequence for a DNA Polymerase fragment (SEQ. ID NO:95). 

When the Glycoprotein B encoding sequence was compared with other sequences on GenBank. 
homology was found only with Glycoprotein B sequences from other herpes viruses. Occasional 
5 sequences of 20 nucleotides or less are shared with several herpes viruses. The sequence 
ATGTTCAGGGAGTACAACTACTACAC (SEQ. ID NO:98) is shared with eHV2. Other than this 
sequence, segments of the KSHV encoding region 21 nucleotides or longer are apparently unique, 
compared with other previously known sequences. 

Within the Glycoprotein B encoding sequence, four allelic variants were noted at the 
10 polynucleotide level between sequence data obtained using the Kaposi's sarcoma sample and that 
obtained using the body cavity lymphoma cell line. These are indicated in the Figure by ant>ws. All but 
one of the variants was silent. The fourth variant causes a difference of Proline to Leucine in the gene 
product. 

The protein pnjduct encoded by the KSHV Glycoprotein B gene has the following features: 
15 There is a domain at the N-terminus that corresponds to the signal-peptide domain (the "leader-) of 
Glycoprotein B other herpes viruses. The complete KSHV Glycoprotein B amino acid sequence with 
that known for other herpes viruses is provided in Figure 3. and reveals areas of homology. Residues 
highly conserved amongst herpes virus Glycoprotein B sequences are marked with an asterisk (*). The 
cysteine residues conserved amongst other herpes virus Glycoprotein B sequences are also present in 
20 that of KSHV. In addition, there are two additional cysteines which could form an additional internal 
disulfide and stabilize the three-dimensional structure (marked by ■•'). The KSHV Glycoprotein B 
sequence also has a predicted membrane-spanning domain that con-esponds to that on Glycoprotein B 
of other herpes viruses. 

The full glycoprotein 6 sequence of RFHV is obtained by a similar strategy. The source for 
25 RFHV DNA is similarty prepared tissue from infected monkeys at the University of Washington Regional 
Primate Research Center. DNA is exti^cted as described in Example 5 

In order to obtain further sequence data in the 5' direction of the coding strand, amplifications are 
conducted using the consensus-degenerate CType 1) oligonucleotide hybridizing upstream from the 
fragment as the 5' primer, in combination w\ih the closest virus-specific (Type 3) oligonucleotides as the 
30 3' primers. Thus, a first series of amplification cycles are conducted, for example, using FRFDA and 
AAITB as the first set of primers. This is followed by a second series of amplification cycles, conducted 
the same primers, or using the nested set FRFDA and GMTEB. Amplification conditions are similar to 

those described for KSHV. 

In order to obtain further sequence data in the 3' direction of the coding sti^nd, amplifications are 
35 conducted using consensus-degenerate (Type 1) oligonucleotides hybridizing downstream from the 
fragment as the 3' primer, in combination with the closest virus-specific (Type 3) oligonucleotides as the 
5" primers. Thus, a first series of amplification cycles are conducted using NVFDB and VEGLA. 
followed by a second series conducted using NVFDB and PVLYA. Amplification and sequencing is 
performed as before. The new sequence is used to design further Type 3 oligonucleotides with a sense 

77 



wo 97/12042 



PCT/US96/I5702 



orientation, which are used with other downstream-hybridizing Type 1 oligonucleotides (namely FREYB 
and NIVFDB) to obtain further sequence data. 

Polynucleotide and amino acid sequence data is used to compare the Glycoprotein B of RFHV 
and KSHV with each other, and with that of other herpes viruses. The RFHV and KSHV sequences 
5 may t>e used to design further subfamily- specific Type 2 oligonucleotides, as in Example 6. 

Exampie 8: Glycoprotein B sequences from DNA libraries 

Complete Glycoprotein B sequences can be obtained or confirmed by generating DNA libraries 

1 0 from affected tissue. Sources of DNA for this study are the same as for Example 7. 

The DNA tysate is digested with proteinase K, and DNA is extracted using phenoi-chloroform. 
After extensive dialysis, the preparation is partially digested with the Sau3A I restriction endonudease. 
The digest is centrifuged on a sucrose gradient, and fragments of about 10-23 kilobases are recovered. 
The lambda DASH-2™ vector phage (Stratagene) is prepared by cutting with BamHI. The 

15 size-selected fragments are then mixed with the vector and llgated using DNA tigase. 

The ligated vector is prepared with the packaging extract from Stratagene according to 
manufacturer's directions. It is used to infect XLI-BLUE^" MRA bacteria. About 200.000 of the 
phag^infected bacteria are plated onto agar at a density of about 20,000 per plate. After culturing, the 
plates are overtaid with nitrocellulose, and the nitrocellulose is cut into fragments. Phage are eluted 

20 from the fragments and their DNA are subjected to an amplification reaction using appropriate 
virus-specific primers. The reaction products are run on an agarose gel. and stained with ethidium 
bromide. Phage are recovered from regions of the plate giving amplified DNA of the expected size. 
The recovered phage are used to infect new XL1 bacteria and re-plated in fresh cultures. The process 
is repeated until single clones are obtained at limiting dilution. 

25 Each clone selected by this procedure is then mapped using restriction nucleases to ascertain 

the size of trie fragment incorporated. Inserts sufftciently large to incorporate the entire Glycoprotein B 
sequence are sequenced at both ends using vector- specific primers Sequences are compared with 
the known polynucleotide sequence of the entire EBV genome to determine whether the fragment 
spans the intact Glycoprotein B sequence. DNA is obtained from suitable clones, sheared, and 

30 sequenced by shot-gun cloning according to standard techniques. 

Example 9: Antigenic regions of Glycoprotein B 

The polynucleotide fragments between the hybridization sites for NIVPA and TVNCB in the 
35 Glycoprotein B gene have the predicted amino acid sequences shown in Figure 14. Based on these 
sequences, peptides that are unique for RFHV or KSHV. or that are shared between species can be 
identified. 

Figure 14 shows example peptides of 6 or 7 amino acids in length. Some of the peptides 
comprise one or more residues that are distinct either for RFHV or KSHV (Class III), or for the 
40 RFHV/KSHV subfamily (Class 11) compared with the corresponding gamma herpes virus peptides. 
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To confirm that regions contained within this lOe-amino acid region of Glycoprotein B may be 
recognized by antibody, computer analysis was perfonned to generate Hopp and Woods antigenicity 
plots. The Hopp and Woods detennination is based in part on the relative hydrophilicity and 
hydrophobicity of consecutive amino acid residues (Hopp et al). Results are shown in Figures 20. 
5 21 and 22. Key: - = antigenic; * = hydrophobic: # = potential N-linked glycosylation site. Figure 20 
shows the analysis of the 106 amino acid Glycoprotein B fragment from RFHV; Figure 21 shows the 
analysis of the KSHV fragment, and Figure 22 shows the analysis of the full KSHV sequence. 

Both RFHV and KSHV contain several regions predicted to be likely antibody target sites. In 
particular, the KSHV sequence shows an antigenic region near the N-tenninat end of this fragment, and 
10 near the potential N-linked glycosylation site. The full-length KSHV sequence shows hydrophobic 
minima corresponding both to the signal peptide (residue -25) and the transmembrane domain (residue 
-750). A number of putative antigenic regions with scores >1.0 or >1.5 are observed. Particularly 
ratable is a region scoring up to -2.5 that appeare at about residues 440-460. 

15 ExamptB 10: Vims specific Glycoprotein B amplification assays 

Type 3 oligonucleotides are used in nested vinjs-specific amplifrcation reactions to detect the 
presence of RFHV or KSHV in a panel of tissue samples from potentially affected subjects. 

For KSHV. DNA is extracted from tissue suspected of hartwring the virus: partcularty biopsy 

20 samples from human subjects v«th Kaposi's Sarcoma lesions and body cavity B-cell lymphoma. A 
number of different tissue samples are used, including some from KS lesions, some from apparently 
unaffeaed Ussue in the same individuals, some from HIV positive individuals with no apparent KS 
lesions, and some from HIV negative individuals. Five samples are obtained in each category. DNA is 
prepared as described in Example 2. 

25 The oligonucleotide primers GLTEA. YELPA. VNVNB. and ENTFB are ordered from Oiigos Etc.. 

Inc. The DNA is amplified in two stages, using primers GLTEA and EmTB in the first stage, and 
YELPA and VNVNB in the second stage. The conditions of the amplification are similar to those of 
Example 3. The reaction product is electrophoresed on a 2% agarose gel. stained with ethidium 
bromide, and examined under U.V. light. A positive result is indicated by the presence of abundant 

30 polynudeoUde in the reaction product, as detected by ethidium bromide staining. This reflects the 
presence of KSHV derived DNA in the sample; specifically, the Glycoprotein B encoding fragment from 
YELPA to VNVNB. Results are matched with patient history and sample histopathology to detemnine 
whether positive assay results conelate with susceptibility to KS. 

For RFHV. DNA is extracted from frozen tissue samples taken from Macaca nemesfrina and 

35 Macaca fascicularis monkeys living in the primate colony at the Washington Regional Primate 
Research Center. Ten samples are taken each from tissue sites showing overt symptoms of 
fibromatosis, apparently unaffected sites in the same monkeys, and con^sponding sites in monkeys 
showing no symptomatology. Nested PGR amplification is conducted first using GMTEA and VEGLB. 
then using KYEIA and TDRDB. Amplification product is electrophoresed and stained as before, to 

40 detemiine whether RFHV polynucleotide is present in the samples. 
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Bxampl0 11: immunogenic regions of Glycoprotein B 

To identify what antibodies may be generated during the natural course of infection with KSHV, 
5 serum samples are obtained from 10-20 AIDS subjects wtth Kaposi's Sarcoma lesions, from 10-20 
HIV-positive symptom-negative subjects, and 10-20 HIV-negative controls. In initial studies, sera in 
each population are pooled for antibody analysis. 

Peptides 12 residues long are synthesized according to the entire predicted extracellular domain 
of the mature KShfV Glycoprotein B molecule. Sequential peptides are prepared covering the entire 

1 0 sequence, and overlapping by 8 residues. The peptides are prepared on a nylon membrane support by 
standard F-Moc chemistry, using a SPOTS™ kit from Genosys according to manufacturer's directions. 
Prepared membranes are overlaid with the serum, washed, and overlaid with beta-galactose 
conjugated anti-human IgG. The test is developed by adding the substrate X-gal Positive staining 
indicates IgG antibody reactivity in the senjm against the corresponding peptide. 

15 Similarly, to identify antibodies formed in the natural course of RFHV infection, blood samples 

are collected from 10 Macaca nemestrina and 10 Macaca fascicularis monkeys, a proportion of which 
display overt symptoms of fibromatosis. The presence or absence of an ongoing RFHV infection is 
confirmed by conducting PCR amplification assays using RFHV-specific oligonucleotides as in Example 
1 0. Plasma and blood cells are separated by centrifugation. These sera are used to test for antibodies 

20 in a method similar to that for KSHV. except that 12-mers are synthesized based on the RFHV 
Glycoprotein B sequence. 

Select RFHV and KSHV peptides are also tested in animal models to detemiine immunogentcity 
when administered in combination with desirable adjuvants such as alum and DETOX^**. Suitable 
peptides include those identified in the aforementioned experiment as eliciting antibody during the 

25 natural course of viral infection. Other candidates include those believed to participate in a biological 
function of Glycoprotein B, and those corresponding to peptides of other herpes viruses known to eiicit 
viral neutralizing antibodies. The peptides are coupled onto keyhole limpet hemocyanin (KLH) as a 
carrier, combined with adjuvant according to standard protocols, and 100 fig peptide equivalent in 1-2 
mL inoculum is injected intramuscularty into rabbits. The animals are boosted with a second dose 4 

30 weeks later, and test-bled after a further 2 weeks. 

Microtiter plate wells are prepared for ELISA by coating with the immunogen or unrelated 
peptide-KLH control. The wells are overlaid with serial dilutions of the plasma from the test bleeds, 
washed, and developed using beta-galactose anti-human tgG and X-gal. Positive staining in the test 
wells but not the control wells indicates that the peptide is immunogenic under the conditions used. 

35 

Example 12: tdentificaVon and characterization of Glycoprotein B from other members of 
the RFHV/KSHV subfamily 

Tissue samples suspected of containing a previously undescrit>ed gamma herpes virus. 
40 particularly fibroproliferative conditions, lymphocyte malignancies, and conditions associated with 
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immunodeficiency and immunosuppression, such as acute respiratory disease syndrome CARDS), are 
preserved by freezing, and the DMA is extracted as in Example 2. Two rounds of PGR amplification are 
conducted using Type 1 oligonucleotides FRFDA and Ts/NCB in the first round, then using nested Type 
1 or Type 2 oligonucleotides in the second round. 
5 Optionally, the presence of an RFHV/KSKV family Glycoprotein B polynucleotide is confirmed by 

probing the amplification product with a suitable probe. The amplified polynucleotide is electrophoresed 
in agarose and blotted onto a nylon membrane. The blot is hybridized with a probe comprising the 
polynucleotide fragment obtained from the KSHV polynucleotide encoding Glycoprotein B (residues 36- 
354 of SEQ. ID NO:3), labeled with ^^P. The hybridization reaction is done under conditions that will 

1 0 permit a stable complex forming between the probe and Glycoprotein B from a herpes virus, but not 
between the prot>e and Glycoprotein B encoding polynucleotides from sources outside the RFHV/KSHV 
subfamily. Hybridization condiUons will require approximately 70% identity between hybridizing 
segments of the probe and the target for a stable complex to form. These conditions are calculated 
using the formula given eartier. depending on the length and sequence of the probe and the 

15 con-esponding sequence of the target. The conditions are estimated to be: a) allowing the probe to 
hybridize with the target in 6 x SSC (0.15 M NaCl. 15 mM sodium citrate buffer) at room temperature in 
the absence of formamide; and b) washing newly fonned duplexes for a brief period (5-10 min) in 2 x 
SSC at room temperature. 

Amplified polynucleotides that hybridize to the labeled probe under these conditions are selected 

20 for further characterization. Alternatively, PGR amplification products having about the same size as 
that predicted from the KSHV are suspected of having a related sequence. Samples may also be 
suspected of having a related sequence if they have been used to obtain polynucleotides 
encompassing other regions of a herpes virus genome, such as DNA polymerase. Samples containing 
fragments potentially different from RFHV or KSHV, either due to a size difference or different origin. 

25 are sequenced across the fragment as in Example 4. Those writh novel sequences are used to 
determine the entire Glycoprotein B gene sequence by a method similar to that in Example 7 or 8. 

A Glycoprotein B encoding sequence from a third member of the RFHV/KSHV herpes virus 
subfamily was obtained as follows. 

DNA was extracted from two frozen tissue samples from a Macaca mulatta monkey with 

30 retroperitoneal fibromatosis. Extraction was conducted according to Example 1 The extracted DNA 
was precipitated with ethanol in the presence of 40 Mg glycogen as carrier, washed in 70% ethanoi, and 
resuspended in 10 mM Tris buffer. pH 8.0. The extracted DNA was used to obtain a 151 base pair 
fragment of a heroes virus DNA polymerase gene, which was non-identical to that of KSHV, RFHV. or 
any other previously characterized DNA polymerase. This lead to the suspicion that the sample 

35 contained genomic DNA from a different herpes viojs. that could be used to identify and characterize a 
new Glycoprotein B gene. 

A 386 base pair fragment of a Glycoprotein B encoding sequence was amplified from the sample 
using a hemi-nested PGR. The procedure was similar to that used in Examples 4 and 5. wrrth a first 
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round of amplification using FRFDA and TVNCB. followed by a second round of amplificatjon using 
NIVPA and TVNCB. The final PGR product was sequenced as before. 

Figure 23 lists the potynudeotide sequence (SEQ. ID NO:96) along with the corresponding 
amino acid translation (SEQ. ID NO:97). Underlined is the 319 base pair sequence in between the two 
5 primer hybridization sites. The sequences are different from those of KSHV and RFHV. The 
Glycoprotein B is from a new member of the RFHV/KSHV subfamily of herpes viruses, designated 
RFHV2. 
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SEQUENCES 



1 

' >° 


D*sigrution 


DMcription 


Type 


Source 




1 


RFHV 


Glycoprotein B PCR segment 


dsDNA 


Figure 1 | 


2 


RFHV 


Glycoprotein B PCR segment 


Protein 


Figure 1 | 


3 


KSHV 


Glycoprotein B PCR segment 


dsONA 


Figure 1 |j 


4 


KSHV 


Glycoprotein B PCR segment 


Protein 


Figure 1 | 


5 


sHV1 


Glycoprotein B sequence 


dsONA 


GenBank | 
HSVSPOLGBP 1 


6 


bHV4 


Glycoprotein B sequence 


dsDNA 


GenBank I 
BhfT4GLYB J 


" 7 


eHV2 


Glycoprotein B sequence 


dsONA 


GenBank | 
EHVU20824 j 


8 


mHV58 


Glycc^rotem B sequence 


dsDISIA 


GenBank 
MVU08990 


Q 




Glycoprotein B sequence 


dsDNA 


GenBank 
EBV 


10 


hCMV 


Glycoprotein B sequence 


dsONtA 


GenBank 
HcHCMVGd 


11 


hHV6 


Glycoprxitein B sequence 


dsONA 


GenBank 
HH6GBXA 


12 


hVZV 


Glycoprotein B sequence 


dsDNA 


GenBank 
HEV2VXX 




13 


HSV1 


Glycoprotein B sequence 


dsDNA 


GenBank HS1GLYB 


14 


sHV1 


Glycoprotein B sequence 


Protein 


Translation 


15 


bHV4 


Glycoprotein B sequence 


Protein 


Translation 


1 ^6 


eHV2 


Glycoprotein B sequence 


Protein 


Translation 


17 


mHV68 


Glycoprotein B sequence 


Protein 


Translation 


18 


hEBV 


Glycoprotein B sequence 


Protein 


Translation 


IS 


hCMV 


Glycoprotein B sequence 


Protein 


Translation 


20 


hHV6 


Glycoprotein B sequence 


Protein 


Translation 


21 


hVZV 


Glycoprotein B sequence 


Protein 


Translation 


22 


HSV1 


Glycoprotein B sequence 


Protein 


Translation 


23 


5HVSA8 


Glycoprotein B sequence 


Protein 


Translation 


24-40 




TYPE 1 oligonudeotides 
(Gamma herpes Glycoprotein B) 


ssDNA 
(lUPAC) 


Table 4 


41-47 




TYPE 2 oligonucleotide 
(RFHV/KSHV subfamify Glycoprotein B) 


ssDNA 
(lUPAC) 


Tables 
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1 SEQ. Dttsignation 
1 ID 


Description 


Type 


Source 


J 






48-55 




TYPE 3 oitgonudeotides - 
RFHV specific Glycopfvtein B 


ssDNA 


Table 7 




56-63 




TYPE 3 oligonudeotides - 
KSHV spedfic Glycoprotein B 


ssDNA 


Table 7 




64.66 




CLASS t antigen peptides 
(Gamma herpes Ctycoprtaein B) 


Protein 


Tables 


67-72 




CLASS It antigen peptides 
(RFHV/KSHV subfamity Gtycoprvtein B) 


Protein 


Tabtefi 




73-74 




CLASS III antigen peptides- 
RFHV specific Glycoprotein B 


Protein 


Table 6 


75-76 




CLASS 111 antigen peptide s- 
KSHV spedfic Glycoprotein B 


Protein 


Tables 




77-78 




TYPE 1 oligonudeotide 
(Gamma herpes Capsid maturation) 


ssDNA 
(lUPAC) 


Table 9 


79 




TYPE 1 oligonudeotide 
(Gamma herpes DNA potymerBSe) 


ssDNA 
(lUPAC) 


Table 9 


80^7 




TYPE 3 oligonudeotides - 
KSHV spedfic Gfycoprotein B 




Table 11 


88-90 




TYPE 3 oligonudeotides - 
KSHV specific DNA Pofymerase 




Table 1 1 


91 . 


KSHV 


DNA sequence comprising encoding regions 
for Caps/dMeft/fBtfon fragment, GiycoprxAein 
B. and 0AM polymerase fragment 


dsDNA 


Figure 18 


92 


KSHV 


DNA sequence comprising encoding regions 
for CapsK^ Maft/rston fragment and 
Gtycoprz^in B (residues 1-3056) 


dsDNA 


Example 7 


93 


KSHV 


Capsid Maturation sequence 


Protein 


Figure 18 


94 


KSHV 


Glycoprotein B sequence 


Protein 


Figure 18 


96 


KSHV 


DNA poiymerase sequence 


Protein 


Figure 18 


96 


RFHV2 


Glycoprotein B PGR segment 


dsDNA 


Figure 22 


97 


RFHV2 


GVycppfO/eirj B PCR segment 


Protein 


Figure 22 


98 




Shared sequence 


dsDNA 


Example 7 


99-100 


1 CLASS 1 antigen peptides oiGlycoprotein B 


Protein 


Tables | 





^7 
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Partial Sequence Listing 



5 (1) GEf€RAL INFORMATION: 

(i) APPLIDWT: Rose. Timothy M. 

Bosch. Hamix 
Strand. Kurt 

TITLE OF INVEIfTION: GLYCOPROTEIN B OF T>€ RFHV/KSHV 
10 SUBFAMILY OF JCRPES VIRUSES 

(iii) NlfBER OF SEQUENCES: 40 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Morrison & Foerster 

(B) STTIEET: 755 Page Mill Road 
15 (C) CITY: Palo Alto 

(D) STATE: OA 

(E) COUKTRY: USA 

(F) ZIP: 943M1018 
(vlii) ATTDRNEY/AGEWT INFORMATION: 

20 (A) MA«: Schlff. J, Michael 

(B) REGISTRATION NUMBER: 40.253 

(C) REFERENCE/DOCKET Nl*©ER: 29938-20002.00 
(ix) TELECCrttJNICATION INFORMATION: 

(A) TELEPHONE: (415) 813-5600 
25 (B) TELEFAX; (415) 494-0792 

(C) TELEX: 706141 

SEQ ID N0:1: 

GTGTACAAGA AGAACATCGT QCCCTACATT TTCAAQGTAC QCAGGTACAT AAAAATAGCA 60 

30 ACATCTCTCA CQGTCTACCG CQGTATBACA GAAGCAGCAA TCACAAACAA ATATSteATC 120 

CCCAQQCCCG TQCCTCTCTA CGAMTCAGT CACATQGACA QCACCTACCA GTGnTTAGT 180 

TCCATSAAAA TTGTAGTCAA CQSAGTCGAA AATACGTTCA CCGATCQQGA TGACGTAAAC 240 

AAAACC6TAT TTCTCCAGCC CGTCGAAQGT aAACTGACA ACATACAAAG ATACTTTAGC 300 

CAACCAGTAC TGTACTCTGA ACCCGWTGG TTCCCAQGTA TaACAGGGT TCQGACAACA 360 

35 GTAAAOUTG Aa^TTGTAGA CATCTT 386 

SEQ ID N0:2: 

Val Tyr Lys Lys Asn He Val Pro Tyr He Phe Lys Val Arg Arg Tyr 
He Lys He Ala Ttir Ser Val T>ir Val Tyr Arg Gly Met Thr Glu Ala 

40 Ala He Thr Asn Lys Tyr Glu He Pro Arg Pro Val Pro Leu Tyr Glu 

He Ser His Met Asp Ser Thr Tyr Gin Cys Phe Ser Ser Met Lys He 
Val Val Asn Gly Val Glu Asn Thr Phe Thr Asp Arg Asp Asp Val Asn 
Lys Thr Val Phe Leu Gin Pro Val Glu Gly Leu Thr Asp Asn lie Gin 
Arg Tyr Phe Ser Gin Pro Val Leu Tyr Ser Glu Pro Gly Trp Phe Pro 

45 Gly He Tyr Arg Val Gly Thr Thr Val Asn Cys Glu He Val Asp Met 

SEQ ID N0:3: 

GTGTACAAO AfiAACATCGT GCCGTATATT TTTAAGGTCC GGCGCTATAG (WAAATTCCC 60 
ACCTCTGTCA CGGTCTACAG QOGCTTGACA GAGTCCGCCA TCACCAACAA GTATGAACTC 120 
50 CCGAGACCCG TGCCACTCTA TGteATAAQC CACATGGACA QCACCTATCA UlULI 1 lAGT 180 
TCCATGAAQG TAAATTJTCAA CGOGGTAGAA AACACATTTA CTGAUGfiGA CtyVTGTTAAC 240 
ACCACAGTAT TCCTCCAACC >WJTAMQGQG CTTACGGATA ACATTCAAAG GTAOTTAGC 300 
CAXCGGTCA TCTACGCQGA ACCCQGCTTjG TTTCCCGGCA TATACAGAGT TAQGACAACA 360 
GTCAACTG7G AGATTGTAG* CATGTT 386 

55 

SEQ ID N0:4: 

Val Tyr Lys Lys Asn He Val Pro Tyr He Phe Lys Val Arg Arg Tyr 
Arg Lys He Ala Thr Ser Val Thr Val Tyr Arg Gly Leu Thr Glu Ser 
Ala He Thr Asn Lys Tyr Glu Leu Pro Arg Pro Val Pro Leo Tyr Glu 
60 He Ser His Met Asp Ser Thr Tyr Gin Cys Phe Ser Ser Met Lys Val 

Asn Val Asn Gly Val Glu Asn Thr Phe Thr Asp Arg Asp Asp Val Asn 
Thr Thr Val Phe Leu Gin Pro Val Glu Gly Leu Thr Asp Asn He Gin 
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Arg Tyr Phe Ser Gin Pro Val He 7yr Ala Glu Pro Gly Trp Phe Pro 
Gly He Tyr Arg Val Arg Thr Thr Val Asn Cys Glu lie Val Asp Met 

SEQ ID N0:5: 

5 ATQGTACaA ATAAACACTT AOQCTTATA AMllUIUil TTTCTAOGC ATCTOSACAA 60 

ACGACACaA CTACAflCTGT TSAftAAAAAT AAAACTCAAG CTATATACCA AGAGTATTTC 120 

AAATATCGTIS TATSTACTQC ATCAAaACT QGAGAATTGT TTAGATTTGA TTTAGACAGA 18D 

ACTTGTCCAA GTACTCAAIi* CAAAGTTCAT AfiOyi^CCCk IILIIIIAST GTACAAAAAA 240 

AATATAGTTC CATATATOT TAAAGTCAGA AGATACAAAA AAATCACAAC A TCAGTCC GT 300 

10 ATTTTTAATG GCTQGAaAG AGMQGTGTT GCTATTACAA ACAAATQQSA ACTTTCTAGA 36D 

QOGTrCCAA AATATGAGAT AWTATTATC GATAAGACTT ACCAATGTCA TAATTOCATG 420 

CAfiATAGAAG TAAACQGAAT GTTAAATTCT TACTATCACA GAGATQGAAA TAACAAAACT 480 

GT>GACTTAA AQCCTGTAGA TOGTUTAACG QGTQCAATTA CAAGATACAT TAGCCAACa 540 

AAAGrmTG CTGATCCTQG CTQGCTATQG QGAACTTAa QGACTCGAAC TACCGHAAC 600 

15 TGTGAAATTG TAS^CATGTT TGCTAGGTg GCTGACCCTT ACACATACTT TGTGACTGro 660 

CTTQGCGACA CAGTASAAGT GILILCH It TGTGATtiTAG ATAATTC ATG CCCAAATQCA 720 

ACTBACGTGT TETCAGTACA AATAGAOTA AATCWJOG TTGTTGAaA TGGAAATAW 780 

QCTACATCAC AfiCAO^TAA AAAAAGAATA TT7GCTCATA CTTTAG ArTA II LIGIII LI 840 

7QQGAAGCTG TAAACAAATC CGCGTCAGTA TQCTCAA7QG 1 1 1 1 1 lUMA GAGTTTTCAA 900 

20 CGAGCTATCC AAACTGAACA TGACTTAACT TATCATTTCA TTCCTAA-nSA AATAACAG CA 960 

QGATTCTCTA CAGTGWAGA ACCCTTAGCA AATTTTACAA GTGATTACAA TTGTCTTATG 1020 

ACTCATATa ACAaACTTT AfiAQGATAAG ATAQCAAGAG TCAACAATAC TCACACTCCA 1080 

AATGGTACAG CAfiAATAHA TCAAACAOAA QGTOGAATSA TTTTAGTGTG GCAQXAHA 1140 

AT/CCAATAG AATTAGAAGA AGCAATCnG GAAQCAACTA CA7CTCCAGT AACTCaAGT 1200 

25 GCACCAAaA GCTCATCTAG AAGTAAGCGA GCAATAAGAA QCATAAGAGA TGTCAGTGCA 1260 

GGTTCAGAW ATAATCTGTT TaATCACAA ATACAATATG CATA7GATAA GCTACGTCAA 1320 

«TATCAACA ACGTCCTAGA fiGASHKCT ATAACATQGT GTAGAGAACA AGTGftGACAA 1380 

ACAATQGTGT GGTATCAGAT AGCAAAAAH AATCCAACAA GTOTTATBAC AGCAATATAT 1440 

BGAAAACaG TCTCTCGTAA AGCnTAaGA GATGTAATa CTGTTACAEiA ATGTATAAAT 1500 

30 GTTCACCAAT CTAGTGTCAG CATACACAAG AGTCTTAAAA CASMAATAA TGAaTATGC 1560 

TATTCACOa aCCAGTTAC ATTTAAATTT GTTAACAGTA GTCAGCTGTT TAAAQGACAG 1620 

TTAQaQQCTA GAAATGAAAT I LHLlblLA GAAAGTUTTC TAGAAAATTC CCACCAAAAT 1680 

GCAGAGACTT TTTTTACAQC TAAAAATGAA ACTTACCAa TTAWATTA TGTCCATGTA 1740 

GAAACTTTGC CAGTGAMAA CATTTCAACT TTWyOCTT TTTTAQCTa TAACaAACT 1800 

35 TTCATAaWA ATATTGACTT TAAASCTCTT GAATTGTAn CAAGTQGAGA GAGWAGTTA 1860 

GCAAACGTGT TTGATTTAGA GACTATGTTT >CAGAATATA ACTATrAOSC TCAGAGTATA 1920 

TCTQGCTTAA GAAAAWTH TGATAACia CAAAGAAACA ACAGAfiftCAG AA7 CATTCA A 1980 

GATTTTTCAG AAATiaAGC AfiACTTAQGC TCTATOSQCA AAGTTATTGr TAATETGQCA 2040 

AQCGQCGCAT II IL I Ll l l l TQGAQGTATT GTAACAGQCA TATTAAATn TATTAAAAAT 2100 

40 CCrrTAQGTG GCATGTTCAC ATTTC7ATTA ATWaSAZAG TTATAATnT AGTAATTCTA 2160 

CTAGTACGGC GCACAAATAA TAIGItlLAA GCTCCAATTA GAATBATTTA CCCAGATGTT 2220 

GAfiWATCTA AATCTACTGT GAOSCaATG GAGCCTGAAA CAATTAAACA AATTTTGCTT 2280 

QGAATGCATA ACATGCAGCA AGWCCATAT AAGAAAAAAG WCAACAAAG AGCTGCTAGA 2340 

CCGTCTATTT TTAGACAAGC TXTGAGACA TTTTTQCGTA AGCGATOQG TTACAAACAG 2400 

45 ATTTCAACCG AAGACAAAAT AGTAT 2425 

SEQ ID N0:6: 

ATCTATTATA AGACTATCTT ATTCTTCGa CTAATTAASG TATG CAGTTT CAACCAGACC 60 

AaACACACT CAACCACAAC CTOCCAAGT ATTTtATCAA CCACCTCTTC CACAACAACA 120 

50 TCAACAAGCA AGCCATCAAA CACAACCTCA ACAAATACTT CATTAQCTQC CTCTCCCCAG 180 

AACACGTCAA CAAGCAAfiCC ATCCACTSAT AATCAGQGTA CCACTACCCC CACTATTCCA 240 

ACTCTTACTG ATGACACAGC CAGTAAAAAT TTTTATAAAT ACAG«n"ATG CAGTCCATCA 300 

T C TT CC T C T G GAGAACTATT CAGATTTGAC CTTBATCAGA CAT GTCCA GA TACA AAAGAT 360 

AAAAAACATG TQGAAQGCAT CCTQCTOGTA CTAAAAAAGA ATATTGTCCC ATACATCTTC 420 

55 AAWSTMOGA AATATAGAAA AATTQCCACC TDCTGACAG TTTACAGAQG GTQGTTXCAG 480 

QCAQCTGTTA CCAATAQQGA TGATATCAfiC AG«2XATAC CCTATAATGA AA TTTCA ATG 540 

ATAGATAGGA CCTATCATTG lll L l L T G a ATPGCAA CAG TCAnAATQG GATTCTGAAC 600 

ACCTATATAG ACAQQGATTC TGAAAATAAG TCTGnCCCC TCCAGCCAGT QGCCQGAOG 660 

ACTGAGAACA TAAACAGATA CTTTAGTCAA CC TCTCA TAT A7GCAG AACC TOOCTQGTTT 720 

60 CCAGQGATTT AT^yiAGTGAG AACAACTGrT AATTCTGftQG TTGTTGADff GTA7QCCCGC 780 

TCrcraaAAC CATATACTCA CTTTATTACA GCTCTQQQQG ACACTATTGA AATUnrCCA 840 

TTCTBTCACA ACAATTCTCA ATGCACCACT QSTAATTCCA CCTCAAQQGA 7GCCACAAAG 900 

GTATQGATAG AAGAAAATCA CCAAACTGTT GAaATGAAA GACQOaOGCA TCCCACTAAA 960 
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GATAMA&M I L I IIC IAAA MiATBAGGAA 
GASMMaCTA lllbiWTTT TGTCATATQG 
CATAA7GAGA QCTTTCAaT TCTQGCAAAT 
CAAGAMMAA CSGfiGCTKQ T(£AAATACC 
5 MTGAAACTC JfiGfiAGfiGfiC AGTCAAAAAA 
GTAAM^TACT ATAAAACAAA TtSSGOGACTA 
A All l blU G AACACACAAA aACACTATT 
AGACAAAAAA QGTrUTGTACy^ TACAAAGACC 
CAQCTTCAAT A7QCCTATGA CCATTTAAGA 

10 ACCAAAACAT SHUCCGOQA ACAAAAAAAG 
ATTAACCCAG TGAGTCTCAT QQCAfiCaTT 
QMGATDCAT TCATOGrTTTC TGAGTQCATC 
AAAAGTATGA GAACG&ATGA TCCCAAOGTA 
TTTCTCAATA GTAOTjCCAC CTTCAGGQGT 

15 ACAAACACAC ACGTCGAAAC TT(iTM>l£CA 
ATG^CACACT AT TTTAAG Gft aATAftATTT 
ACCCTO5ATA CATTTTTAAC TUTCAATTTA 
GTQQVACTrr ACAGTGW3AC TGAAAGAAAG 
TTTAGHfiAGT ATAATTACTA CACACAGAAG 

20 ACCATTGACC TGAACAQQtJA CASAaAffTT 
GQAGACATTG GAAAAGTGGTr GGTCAACACA 
ATAGnOGTC GATT7GTCAG l illMUVCA 
CTTCTCATAG IlblUillll lUI lUI 1 1 1 1 
GAQGCCCCCA TAAAAATGAT CTATCCAAW: 

25 CAGCCCCTAC CCQGAOAQGA GATTAWCGC 
AGfTGAGCACG GCAAATCTCA GGAMaAQOa 
QQGGATGQCC TACAATTGCT GCGCAfiXGC 
AGTCCAQGCA ATGACACATC TCAGACACAC 

30 SEQ ID N0:7: 

A TQGaGST CG QGQXGQGCC TCGOaTCGTC 
TGCCAQSQQS TQXGCAAGA AUIIUIUGCT 
CCAGAAGTGG TGGCCGAGGA GAACCCGQCC 
QCCTtGCaA CGQGCGGMliA GATATTCAQG 

35 CMWMCG ACCACATAGA QQQCATAOa 
GTTTTTAATC TCAGAAAGTA TAQSAAGATC 
AGCGAQCaATS CCATAACAAA CCAGCACACG 
CAGATaATSG ACCACTATTA TCAETQCTTT 
GTLAACACCT ACTATGACAG GGACGQGTGG 

40 QGTCTCACCT CTAQCATAAC QCQCTATCAG 
AACCTGTTGT GGTCTTACAC AACAAGAACC 
GCGWiATCCA TGAAACCATT TGAiillLlll 

TCGCccrnr taaaagaaaa tgqcacagag 

ATTCAACTGC TCAAAAACTA TGCTGTCACA 

45 AACGCTACCA G A I IL II IU C AATATTTQGG 
GAAAACAGCT CaACTGTCA TTTAATTTTA 
CAACACAATA QCAGTCTCCA TTTTATTGCC 
TTAaAAaAAG AQGCTAATTT TAACGAGACA 
GAAATTCAAA AAAAGHAAA AGAGCTTGAA 

50 (HCTATAAAA CAAD^QCAA TCTGTACATT 
a/CMfCrC ATGCCAAGCT G7ACAATCTC 
CCCALAACAT CTCCCM£AG AMytCGCAGG 
AATSQAGACA ACTCAACTAA CGf^AO^fiGl 
GACAATCTCA GAAAfiAGCAT CAACAQQGTC 

55 GAADWJTACA QQGCCTOjCT CATGTlffiTAC 
ATGWCGCCA TCTATQGCAS GCCWalblLI 
TCAaATTCTA TCAGTGTTGA CCAAAAGAGC 
QQCAAAGAAG AC L i mbl l A CACCAGGCa 
GAACTCTTTG CTQQCCAQCT QtSGTCCCAQG 

60 GMaSTUTGCC AGCACM3CTG CGAGCACTAC 
AAQGAaACT AaATEHCAG TACCCTCAAC 
ATTACCCTGA AC CmiL T C T GGTA GAGAAT 
AAAACJCAGA AAAQGCTOTC CAACGTCTTT 



TATACCATa CCTGGAAAGC A(aAAGATA&A 1020 

AAAACCTTTC CCAQQQCCAT ACAAACAATC 1080 

GAAGTCACAG CCAGCTTTTT AACATCCAAC 1140 

GAGATATTGA ATTGCATGW TAGTACCATA 1200 

TTTAACAAAT CCCATATCAG A&ATGQQGAG 1260 

TTCOTATa QQCAQXAAT GAAACCCCTT 1320 

GAAAjSAATA ACAAGAC7QG AAATAAATCA 1380 

TTCCAAiarG CCAAQQGOrr GTCCACreCC 1440 

M:AAGCATGA ATCACATOH' MaAOGAAITA 1500 

GACAATCTAA TGTGGTATGA GCTGAGTAAA 1560 

TAT13GGAAAC CTGTGQCAGT GAAAOCCATG 1620 

AATGTnjACC MaSCAAGTGT CAATATCCAT 1680 

TGTTACTCCA GACCCCTOGT CACATTTAAA 1740 

CAGCTTTaSAA CAAQGAATGA AATUTTGCTC 1800 

ACAGCAGATC ATTATTTTTT TCTAAAGAAC 1860 

GnSAAGACAA 7GGATACCAA TAACATATCC 1920 

ACTTTTATAG ACAATATASA TTTDUGACA 1980 

AIGGCCAGTG CCCTCGACCT QMM:GATG 2040 

OTGCAACTC TlaAGAGAAGA TCTAGACAAC 2100 

AAAGATOa CTGAAATCAT QQCAGACCTT 2160 

TTT>GTGGCA T7IITCACTST IIIIUdblLI 2220 

AACCCCATTQ QQQQCGTGAC GATCATCaC 2280 

ATAGTOCCA GC;AGAACCAA TAACATGAAC 2340 

ATTGACAAAS CCTCTGAGCA GGAGAACATT 2400 

ATCCTCCTTC GAATTjCACCA QCTCCAGCAA 2460 

AQCCATAAAC CAQGGTTGn CCAAaATTG 2520 

QQGTATACTA QGTTACCAAC TTTniACCCC 2580 

CAAAAATATG TTT 2623 



C IC m i LTA T anOCGTCGC TQCGCTTCTC 60 

GAAACGACCA CCCCGTKSC AACCCACM>A 120 

AAOXCTTTC TQCCGTTCAG QGTATQCGGG 180 

TTOXCCTGG AGGMaAGCTG CCCCAACACG 240 

OCATaACA AGACCAACAT AGTGCCTTAT 300 

ATGACCTCGA CCACCATCTA CAAGQtTmGG 360 

AfiGAGaACG CCGTCCCCCT GTACGAQGTrC 420 

AGCGCCGTAC AGGTCAACGA GQQQGGGCAC 480 

AACGAGACCG CCTTCCTCAA ACCQQCCGAT S40 

AGTCAACCAG AGGDGTACGC CACCCCCAGA 600 

ACAGTCAACT GCGAQGTGAC AGAGATGTa 660 

GTCACGTCTG TTQGTGACAC TATA&GATC 720 

CCAOMjAAAA TCTTGAAAAG ACCACAaa 780 

AAGTACQGTG lUiibl lU bb GCAOQCTGAT 840 

GACTATTCCC TCTCTTQGAA AGCCACCAa 900 

TQGAAGQQGT TTTCCAATQC CATTCAAACT 960 

AAT&kTATAA CAQCCTCCTT CTCTACTCCT 1020 

TTTAAGTGTA TATGGAACAA CACCCAAGAA 1080 

AAAACTCACA GACCTAAOSG TAOQCGAAG 1140 

GrrraocAAC cgcttataca gatagacctc 1200 

ACAAAOJCTA CAGCTTCACC TACATCAACA 1260 

GATAOTCAA (JTCTTACTCG CGGTQGAAAT 1320 

gtqgcqqcct cccassttca GrrrocaAT i38o 

TTQQGAGAX TGTCCAQQGC ATOHGCMaG 1440 

GMOGAGCA MMTCAACCC CACXJCCSTC 1500 

GCCAAGTTGA TAGGOMCGT QGTUTCAGTG 1560 

G l bUililU C ACAAAAATAT GAAQGTtiCa 1620 

GTGGTGGXT TCA«TTTAT CAATOOGAGC 1680 

AACGAGATTC 7QCTGTCCAC CTCTCAGGTC 1740 

TTCCAQQCCG QGAACCAGAT GTACAAGTAC 1800 

CTGACTGACA TACCCACCCT ACACACCATG 1860 

ATAGATTTTA AQGTGATTGA GCTCTATTCT 1920 

GACAK:GAGA CCATGTTCAG QGWH-ACAAC 1980 



^0 
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IfiaKfOC AGAACCTCAA CQQQCTQW3G 
MSjMCAGCT TCATCCAGAC CCTG GSrSAC 
GTOSTOGTCA ATGTOGCCAC CGSm Ullt 
ATAAGCTTTT TCAAAAATCC CTTTQQQGQC 
5 GTAG7BGTOG TGTACCTGTT TATGACC^ 
A^irraACC CCGGGGTS S ft GAQQQCGQCC 
GACCAAATCA GGAACATCa GATQQMATS 
GAGGAGXCC GACGAGAG(aA A(yUGTAAAA 

GAnrrxav ccAGCGrrcr gaggaq(>mm 

10 CGMaACGQGA GCGACGATGA GQS3GATTAT 
GACGACGTGG ACGTQ&AGGC AQGCACGGCG 

SEQ ID N0:8: 

ATSTACCCTA CfGTGMMG IkJGAGfiGK 

15 I lilU i aibl aCACACQCA TCTCTACGTA 
GACATGACCC CAGCCCAGE^ CGCTtXAACA 
AAD«y«3WT rnyCTACTT TCQCGTUTUT 
TTTGATTTAG ACAAAACATG ODOUCTfiLA 
CTCGTOTATA AGATCAACAT CGTCCCCTAC 

20 ATTACTCAAC TGACCATCTG GCGWGCaA 
ATQGCCACTC AGGCCCACGA GIOSMAGTC 
AATAGCGCCA CCATSCtTOGTr AA ACAACG TC 
AATAAAACTG TGAACATACG CCCTCTTGAT 
AGTCAQCXCA (XCTTTATTC AfiWCCTQCT 

25 ACCGTTAACT GTGAAATTGT AGACATGGTG 
GCTACCGCCC TQGGAGACAG CCTQGAGCTC 
CAGTCTACTG CGCaAAGAG AfiCTWTATS 
GTAGATTATA ATAACAGQQG AACTQCCCCC 
CCCTCTQCa CTTACTCCTB GWWVMQCC 

30 CACTTStSAWA CATTCCCTCG CQCaTCCAA 
GCCAATaWG TCACCQCCAC CTTCAATAW 
ACGTATWCT QCGTC«TBA CXWATCAAT 
AACAACTCCT ACGTQGCCAG TOSGWAACA 
CTCAinmC AACCACTCGA ACATCCAGAG 

35 GAACCAACCC CCGCCCCACC AAAGTCCACA 
AAaCAACAT C7GAQGTCTC AAAQQGCTCA 

rrraxTATC acaagctwc caccagcgtc 

TQGTCTAGAG AACAGCnT:AG AGACACCCTC 
ACGAGTCTGA TSTCTT3CCAT TTATCtiAAAG 

40 ATATrrGTCA CAGACTCTAT aATGTQGAC 
CGGCTGCAGC ATGATAAAAC CACCTQCTAC 
AACAGTACAG ACCCGCTAAC TGQCCMITTG 
ACAAAaTAG AAACATQCAA GGATGAGAGT 
TAaATTATA AAAAHACAT TTTTCAAGAA 

46 GACACATTTA JAOlCCKM TATCTCATTT 
CTGTACTCCT CTACTGAAAG GAAACTCQCA 
AQQGAATATA ACTATTACAC aACAfiCCTC 
ATCGACTACA ATAGM^ACAG MTTQGTTCAG 
GACATTQGAA GATOGTOGT GAATUTQCITC 

50 GTG«:AQQGT TCATTAAAn CTTTACCAAC 
ATTQGTGGAA TAAILIILIl GGTGGTAOTC 
GCACCCATCA AAATGC7CTA CCCTTCTGTr 
CCCTATAGCG CATCACCTCC AfiCTAT«AC 
ATGCATCAGG TACACCAQU AGAAAAOGAA 

55 A LI ll b l U SC AGAAAGCOC AGGATTCCTT 
CCmTGGAAG ATGAATCWt TTCCOa 

SEQ ID N0:9: 

ATEAC7U3QC GTAQQGTCa AAXGTOETTC 
60 GGTmCQCAGA CCCCAGAGCA QCCCtSCKCr. 
CCTCAQCAAA CCAQCTTTCC TTTCCGAGTC 
CQCTTCTCCT COiACATCCA GTGTCCCTCG 
CTGnTGATCG TCTTTAAAGA CAACATTATT 



AAGGACCTGG ATSACAGCAT AGATEATQGC 2040 

ATCATQCAQG AaTTQaOCAC CATAGGCAM3 2100 

TC aiLl i m QGAGCATAGT CTCQQQQGTG 2160 

ATGCTECTCA TAGTCCTCAT CATAGCCQGG 2220 

TCCy«3GAGCA TATAOCTOC CCCCA7TAGA 2280 

CAGGAGCOaG GCGCGCACCC GGT?n>GAA 2340 

CACCAATTTC AGCAGCGGCA GCGQQCGGAA 24O0 

GGMAAM^AA L i L I LIUb A AGTGATAAGA 2460 

AGK3QGGGTG GTGQGTACCA GCGCaACAS 2520 

GMSCCATTGA GQCGACAAGA TGGAQGCTAC 2580 

GATACCQGTC TGTAA 2625 



GCCCACCTAA CCAATCTCCT AACCCTTCTG 60 

TGTCAGtXAA CCACTOBAG (SCAGCCATCA 120 

GAGACTCCCC CACCCCTCTC AACTAACACT 180 

GSaGTQQCrS CCACGQQQGA GACCTTCAGG 240 

CAASATAAGA AGCATGTGSA QQGCATCTTG 300 

ATCnCAAAA TCAGSAGATA TAGAAAAATA 360 

ACCAaAGTT CAGTCACTCG TAAATTTGAA 420 

GQCGACTTTG ADUaCATUTA TCAGKSCTAC 480 

AGACAOrrGT ATGTQGACAG AUTOQQGTC S40 

QSTCTAACAG GGAATATCCA AAGATAHTT 600 

TQGATGCCTG GCTTTTATCG TUTTCGAACC 660 

GCACGCTCCA TOGATOXTA TAAaACATC 720 

TCCCCGTTTC AAACCTTTCA CAACACCAQC 780 

AGQGTCAGQS AGGTCAMaAA TTACAMTTT 840 

GCTGGACAAA QCM2yurTT TUT/VMaACT 900 

ACCAGACAAA C7GCCACGTG CGACCTQjTTi 960 

ACTXICATG AACATAOTA CCATTTTCTC 1020 

CXXCTGACTG AGCjTMMAA TTTCACCAQC 1080 

AA6ACCATCT CTGAATATAT CCAAAAGTTC 1140 

CAGTATTTCA AaACTGATOi TAACCTETAC 1200 

ATTGAAGACA TAGACGAQGA CAGCGACCCA 1260 

/^GAGAAAAA GAGAGQCAGC TGACAATGGA 1320 

GAAAATCCQC TCATTACGQC CCAAATTCAA 1380 

AACAACGTGC TTGAGGAGTr CSTCCAflOGCG 1440 

ATGfTGGTATG AGCTTAGCAA GGTDMCCa 1500 

CCTGTCGCTC CCAQCTACGT QQGCGACQCC 1560 

CAAAGTTCAS TCAACATCCA CCAG«3CTTG 1620 

TCGAGACCTA GAGTCACCTT CAAATTTATA 1680 

GGTCCTAGAA AAGAAATTAT CCTCTCCAAC 1740 

GAACACTACT TCATTGTOQG QGAATACATT IBOO 

A>GCTAAACC TCTCAAQCAT CGaAOrTA 1860 

ATTGAAAATA TCGAOUAA AACAGTAGAA 1920 

7TCGAQC6TCT TTGATATAGA ATCCATGTTT 1980 

QCGQGCATTA AGAMjGACa AGACAACACC 2040 

GACCTGTCAG ACA7GATQQC TGATCTQQGA 2100 

AQCTOSCTAB TCACATTTTT CAGTAGTATT 2160 

CCTUTAGQQG GAATATTCAT TOCCTAATT 2220 

aAAATAGAA GAAACTCACA GTTTC^GAT 2280 

GAAAAaACG CTGCCAGU> GGCGCCACCT 2340 

AAMy^SAAA TTAMXGCAT ACTTTTGOGC 2400 

GCACAGAAAC AAaAACCAA OOQGCCCT Z460 

AGAAA7CGCC QGAAQQCSATA CAGCCAACTT 2520 

2548 



GTBCTQCTAG CCGeXCTBGC GTQCCGTCTC 60 

CCCQCCACCA CQGTOCAfiCC TACC GCCACG 120 

TGCGAGCra CCWCCACQG CGACCTCTTC 180 

TTTQQCACGC CSGAGAATCA CAOSGAGQGC 240 

CCaACTCGT nAAOJTCCG CTtaACACC 300 
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AAfiMAffTSA CCAACATTCT CATaACAAT 
CACOGGAGA ASTTCTCCGT TEACAQCTAC 
TGCTACAACG CGGITCAAGAT GACAAAAGAT 
GGAGTTAACA TDCCGTCM CCTAAAQCa 
5 TACGCCAGCC AGACG»GCT aATGACQCC 
AGAAaACCG TCAACTGCa GATAACTGAC 
TTCTTTGTGA CCACCACCQG GCAGWrTOTC 
AAGGAAACa TCCATGAGCG QGCAGACTCC 
GACTACGACA ACCGAQQGAC GAACCCGCM 

10 ACTTACACGC TA7CTTQGAA QCTCGKiAAC 
CAAACCTTTG ACTCGACCAT CQCCACAGAA 
GAQtSGCACa CTAGCTTCGT GACCAACACA 
AAG;FSCATCG AMj^^CAGGT gaacaagacc 
CGTTACAC&A AGGQCCAGGA AQCCATTACA 

15 QCTTQtSCTAC CTUTTiACCCC QCQCTDJTTG 
ACTCCGACTT CCTCACCCCC CAQCAGTCCA 
AQCACCCCCG COSCCGnCT GAQGCGTOSS 
GTGCCCCCCA CQQCCCCCQG GAWTTCCCTC 
ATCCAATTTG caACGACTC CCTGCGCCGC 

20 CQQGCCTGST XCTQGAGCA GAAGAGGCAG 
AATCCAACCA CCGTCATGTC CAQCATaAC 
GATCTCATCT CAGTCTCCCA GTOCGTOCCC 
AGCATCAQGG TCCCTCGCTC CGAGACCATG 
TTTATDVACG ACACCAAGAC aACGAGQQA 

25 ACAAAAAA£A TGACGQAGGT GTGCCASEXG 
GAQATCCACG TCTACAACGA CTfCCACCAC 
AaCTGCAfiA CCTTCATCTC ACTAAACACC 

CTGGMrrur actcacqqga cgaacagcgt 

TTCCQQMGT ACAACTTCCA GGCQCAAAAC 
30 GCAGTGTCAA ACGQAAGAAA TCAATTCGTi; 
QGTAQCGTQS GTCAGTCCAT CACCAACCTA 
UUilL i nU GTTTCATCTC tllLIILAAA 
CTOGTOQCQG QGGTQCTGAT CCTCGTTAn 
CAQCAGCCQG TGCASATGCT CTACCCCQQG 
35 QGTCAQQGTC CAQSCATTAA 7CCCATTAGT 
CTQCAITSAGC AAAACDV£A GCAAAAGMsA 
GCCAGCAGAG CATTGCAGGC AGCCAGQSAC 
CACGATCCAG AWCCGCCGC CQCAC7GCTT 

40 SEQ ID N0;10: 

ATCGAATCCA QSATCTOrrT; CC7HJTAGTC 
GCTG LUm il aiL I I LIA G TACTTCCCAT 
ACTTCTCGTA CGACGTCTIGC TCAAACCCQG 
GAAGCCGTCA GTCATM^ CAACGAGACT 

45 GTOGraoaW TCAACACTAC CAAGTACCCC 
GATCTTATTC GCTTTGAAOa TAATATCATC 
TTGGATSAQG QCATCATQGT GGTUTACAAG 
CQQGTCTACC AAAAGGmT GACGnTCGT 
CTGCTQGGCA GCAATACGGA ATACGTGGCG 

50 AAGTTTQCTC AA7GCTACAG TTtXTACAGC 
TA7CATAQQG ACAGTTATSA AAACAAAACC 
ACCCACAGTA CCCdTTACGT GACGGTl>AG 
CTCTATCGTG AGACCTtn-AA TCTGAACTCT 
TATCCTTATC Al 1 1 1 1 1 lUC AACTTCCACG 

55 AACGGAACCA ATCGCAATQC CMjCTACTTT 
CCGAACTACA CCATCGTTTC CGACTTTQGA 
TTQGTQGCTT TTCTCGAACG TOXGACrCG 
AATGTCACCT GCCAQCTCAC CTTCTQQGAA 
GAAGIVCTCGT ACCACTTTTC TTCTGCCAAA 

60 GAAG1GAACA TGTtCGACTC CGOQCTQGAC 
CW3CA6ATTT TCAATACTTC AT/tAATCAA 
TTCGMACCA UmbLUjILi UilUblb l I C 
GAATTOGAAC b lll U ULLAA TCGATCCAGT 



QGCraGTAOS CQGACTCCET GACCAACOSG 360 

GAAACTGACC ASATIQGATAC CATCTACCAG 420 

GGQCTCACGC GCGTGTATUnr M^UXGCGAC 480 

ACCGGQGGCC TQQCCAACGG GGTGCGCCGC 540 

CC CG OGTOGT TGATATGGAC TTACAGAACA 600 

ATGATOGCCA AGTCCAACAG CCCCTTDGAC 660 

GAAATCTCCC CTnaATGA CQQGAAAAAT 720 

TTCCACGTCA GAAaMCTA CAAGATAGTG 780 

GQCGAACGCC GAGCCTTCa GGACAAGGQC 840 

*BACAGCCT ACTOCOOT TDW^CTQG 900 

KAGOMGr CAATACATTT TGTGAOGAC 960 

A CCG TOS G CA TMaAGCTCCC GGAOjCCTTC 1020 

ATGCATCAGA ASTACGAGQC CG7CCAGSAT 1080 

TATTTTATAA CGAQCGGAQG ATnjTTATTA 1140 

QCCKDSrCA AGAACCTGAC GGMiCTTACC 1200 

TCGCOXCAG CCCCATCCGC GGCC0GCGQ3 1260 

AGQCGQGATG CGQQGAACGC CACCACACCG U20 

QQCACCCTCA ACAATCCCQC CACCGTCCAG 1380 

CAGATCAACC GCATGCTDOG AGACCTTGCG 1440 

AACATQGTQC Vj/GfiCfiACl AACCAAGATT 1500 

GGTAAGGCGG TGQCGGCCAA GCGCCTGQGG 1560 

GTTAACCAGG CCACCGTCAC CCTGCQCAAG 1620 

TOaACTCQC QCCCCCTQGT GTCCTTCAGC 1680 

CAGCTGQGCA CCUCAACGA GATCTTCCTC 1740 

ACCAGCCAGT AaACTTCCA GTCCQGCAAC 1600 

TTTAAAACCA TCGAflnOGA CQQCATIGCC 1860 

TCCnCATCG WiWCATTGA CTTTGCCTO: 1920 

GCOCCAACG TUTTTGACCT GGAGQQCATC 1980 

A TO ZCG GC C TQCQGIWU3GA TTTQGACAAT 2040 

GAOQGCCnS GQGAACTTAT GGACAGTCTG 2100 

GTCAGCACGG I tfjUUUm il GTrTAQCAGC 2160 

AACCCCTTCG GCQQCATGCT CATTCTQCTC 2220 

TCCCTCACGA GGCGCACGCG CCMaATGTCG 2280 

ATCGACGAGC TUSCTCAGa ACATQCCTCT 2340 

AAGACAGAAT TACAAGCCAT UTGTTAGCG 2400 

GCIAGCTCMM GQQCGGCCGG ACCCTDOG 2460 

eun i llC AG GCaACGCAG AAGACOaAT 2520 

GGQGAQGCAG AGACTGAGn TT 2572 



TaCGTTAACC TGTBTATCGT LHjI LIU A^I 60 

GCAA LIILII CTAODUZAA TGGAAQCCAT 120 

TLAGTCTATT CTCAACACGT AAC G I L II Ll 180 

ATaACAACA CTACCCTCAA GTACQGAGAT 240 

TATCQCGTGT GmtTATOGC CCAQGGTACG 300 

TGCACCTCGA TGAAQCCTAT CAAJtM&C 360 

CGCAACATCG TQQCGCACAC mTAAQGTA 420 

COTAGaACG CTTACATCTA CACCACTTAT 480 

CCTCaATGT QQGteATTCA TCACATCAAC 540 

CQCGTTATA6 GAQGCAOiGT ITT LG IU iC A 600 

ATQCAATTAA TTCCOiAaiA TTATTOAAC 660 

GATCAGDQGC ACAGCCGCQG CAGCACCTQG 720 

ATOCTGACCA TCAaACTGC GCGCTCCAAG 780 

QGTTGATGTQG TTTACATTTC TaillLIAC 840 

QGAGAAAACG CC&^CAACTT TTTCATTTTC 900 

AGACCCAACG CTGCGCCAGA AAaCATMjG 960 

GTG AIL I Ln GQGATATAU QGACGAGAAG 1020 

GCCTCGGAAC GTAaATCCG TTtCGAAGCC 1080 

ATGACTOCAA ClIlltlUIL TAAGAAACAA 1140 

TQCGTACGTG ATGMQCTAT AAATAW7TTA 1200 

ACATATGAAA AATACQGAAA CGTGTCCGTC 1260 

TGGCAAGGCA TDWSCAAAA AILIllUbIb 1320 

CTGAATATCA CTCATAQGAC CAGAAGMGT 1380 

9^ 
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ACGAGTGACA ATMTACAAC TCATTTGTCC AQCATT3GWT CQGTOCACAA TCTOGTCTAC 1440 

QCCCAGC7QC AGTTCACCTA TWCACGTTG CQCQGTTACA TCAACCQQGC GCTQQCQCAA 1500 

ATCGCAfiAAG CCTQSTGTST QSATCAACQG CQCACXTAG AGUILI lUA QGAACTCAQC 1560 

AttATCAACC CGTCAQCCAT TCTUTCGQCC ATTTACAACA AACOjATTGC CQCQCGTTTC 1620 

5 ATOQGTCATG TCTTQQGCa QQCCAQCTGC GTWCCATCA ACCAAACCW CG7CAAQGTO 1680 

CTQCGTGATA TSAACGTGAA QGAATCGCa QSACQCTQCT ACTCACGACC CGTOCTCATC 1740 

TTTAATTTCG CCAACAQCTC GTACGTOCAG TAOSGTCAAC TQQQCGWGA CAACGWATC IfiOO 

CTUrraOGCA ACCACCKIAC TSAflSftATCT CAQCTTCCa GCOCAAGAT CTTCATCQCC 1860 

QQSAACTCQG CaACGAGTA CGTOG«TAC CTOTCAAAC GCATCATTSft CCTCAQCAGT 1920 

10 ATCnXACCG TCGACAGCAT GATOSCCCTG GATATONCC CQCTGGAAAA TACCGACTTC 1980 

AQQGTACTQG AACTTTACTC QCAGAAAaAG CTOCGTTCCA QCAACGTTTT TSACOCGAA 2040 

GteATCATQC GCGAATTCAA CTCGTACAAG C«Ca9STAA AGTACCTOGA GGACAAQGTA 2100 

GTCGACCCGC TACOiCCCTA CCTCAAQQGT CWCKGfCC TCA7G«3CQG CCTBGQCGCC 2160 

QCQQGAAAQS CCGTO3CGT AQCCATTGQG GCCGTOQGTB GCQOSSTCQC aCCGTPGTC 2220 

15 OW3GCGTTG CXAOnTCa CAAAAACCCC TTCQGAQCCT TCACCA7CAT CCTCGTQGCC 22B0 

ATAGCCCTAG TCATTATCAC TTATTrGATC TATACTOAC AQCGtSCGTCT GTGCACQCAG 2340 

CCQCTQCAGA ACCTCTTTCC CTATCTQGTO TCCQCCG«a» QGACCACCGT GACGTaSQGC 2400 

AGCACCAAAG ACACGTOITT ACAQQCTCCfi CCTTCCTACG AQGWAETGT HATAATTa 2460 

QGTCQCAAAG GACCt3QGACC ACCGTCGTa GATtSCATCCA CQGCQGCTCC GCOTACACC 2520 

20 AACGAQCAQG CTTACCAGAT QCTTCTOSCC CTBGCCOnC TGGACQCAGA OAGCGflOCG 2580 

CAGCAGAACG GTACAWTTC TTTCGAOjW CAfiACTOOA OSCAGGACAA a3GAC«AAG 2640 

CCTAACCTGC TAMCCQGa GCGACATCGC AMAACQQCT ACAGACACTT GWGACTCC 2700 

GACGAAGAAG AWACGTUTC AA 2722 



25 SEQ ID MO:U: _ 

ATWfiCAASA TSAaMTATT ATTCCTT3GCT UICIIIllbA TBAATASTCT TTTAA TCATA 60 

TAT7T3CGATT CQG4TGATTA TATCAGAQCG GGCTATAATC ACAAATATCC 1 1 1 ILbUA TT 120 

TCTTCGATTC CCAAAGGCAC TSATTTGATB CQGTTCGACA GAfiATATTTC GTCTTCQCCA 180 

TATA«TCTA ATGCAAAGAT GTCQGWQGT TTTTTCATCA TTTAtlAAAAC AAATATCSW 240 

30 ACaACACTT TTCDCTGAG AACATATAAA AACGAQCTGA CGTTCCAAAC CAGTTACCGT 300 

GATUTQQSre TGETTTATTr TCTQQATCQG ACQSTGATOG GIUUULCAT QCOjSTGTAC 360 

GAAGCAAATT TAinTAATTC TCGTGCGCAG TGrTTATTCAG CCGTAQCG^T AAAACGACCC 420 

GATQCTAOjG TCTTTAimSC CTATCA7GAG GATAATAATA AAAACGAAAC laAGAATTA 480 

rrrcaosA atttcaagtc TGrrACTAAT aaawsattta tcact acgaa agaaccctac 540 

35 TTTGCAAGQG GTCCTTTCTG QCraATTCT ACATC GACGT CTCTC AATTC TATTCTGACG 600 

GWtjCTACGG CTAAQGCGAA ATATCCGTTT «TTACTTTG CTTTEiACGAC TQGTCAAATC 660 

GTOGWQQCT CTCCGrrrrr CGACQGTTCA AAOSGTAAAC ATTTTQCAGA GCCGTTAGAA 720 

AAATTWCAA TCTTQGAAAA CTATACTATIG ATAGAASATC TAATSAATQG TATWTPG G 780 

GCTACTAOIT T«TAAQGAA GATCQCTTTT CTCQteAAAG QQSATACTTT GIIIILIIUG 840 

40 GAAATDWGG AAGAGWTW ATCQGTGTCT ATBCTAAAGC ACTQGACTAC QCTBACTCAC 900 

QQQCTTCGAG OaGAGACQGA TGWyCTTAT CAOTTATTT CTAAQGAGTT GACAaCCGQ 960 

TTCGTCGarr CCAAQGAGTC TTTAAATm ACCGATCCCA AACAAACGTC TATTAAGAAT 1020 

GAATTTCAGA AGATAAHAC AGATCTaAT ATCTCAGATT ATAATCATGA CTACAQCATG 1080 

AACQGTAGTT ATCAAATTTT TAAGACTACG QGWiATCTCA mTGATTTG QCAQCCTCTT 1140 

45 GTQCAWAAT CTCTTATOGT TCTTCACCAG QtnTCACTAA ACTTACGT« GAQQCGAGAT 1200 

TTOGTQGATG 7CAAGTCTAG ACATGATA7T CTTTATIITGC AftTTACAGTA CCTCTATGAT 1260 

ACTTTGAAAfi ATTATATCAA CGATQCCTTG QQGAATTTQG CAGWTOTG GTGCCTCGAT 1320 

CAAAAAOSAA CGATAACGAT GnOCACGAA CTTWTAAGA TCA GTCCATC GAGTATOJTC 1380 

TTTCAQGTTT ACQCnCGTa GATATUTQCA DCTTQCATC GTGATGTUTT AQCTATCTCG 1440 

50 AAATQCATAfi AAETTAATCA ATCATtXCn CAQCTTTATA AGWrTATOG GGTOnCGAT 1500 

GCGAAQGGAG TAW3GAGTOA AACGATGTCT TATAATCQQC CCTTGGTCAC GT TTAQCTTT 1560 

GfTGWCTCCA CGCCTCAQGT I b l LLIIU^ T CAGa«QGT TAGATAATGA GAIIll bllb 1620 

QGTOATCATA GQACAG^tGGA ATGTCASATA CCTACTACJm AGATATTTa ATCTSGAAAT 1680 

CATBCACACG TGTATACCGA TTATAOjCAT ACGAATTCW CQCCCAT*GA AGACAT7GAG 1740 

55 GTATTQGATG CTTTTAnAG ACTAA«»TC GACCCTTTCG AAAATOCTW TTTTAAACTA 1800 

CTTCATTTAT ATTCQCCQSA CGAATTGAffT «HfiCAAACG TTTTCGATTT AfiWAATATT 1860 

CTTCGTliWT ATAACTCATA TAAGWOjCA CTATATACTA TAfiWGCTAA AATTGCTACT 1920 

AATACQCCGT CGTATCTCAA -raOGATTAAT ILIIllllAC AAQOOCTrOG GGCTATAQG C 1980 

ACTOafcTTOG aiCGGTTAT AAGTOTTACG QCJCCfiCCAC TTQQQGATAT IblUGGTOW 2040 

60 (albb l bltll TTTTAAAAAA 7CCATTCQQG QSraCTCTCA TCTTCATnT AQCGATAGTA 2100 

GTTCTCCrTA TAATAATTGT Om 1 1 l Ua l I ASACAAAAAC ATGTQCTTAG TAAQCaATT 2160 

GACATMTGT TTOTrATQC CACCAATCCG GTCAaACTC TCTCCAGTCT TAaaaSGACC 2220 

ACTUTCCTCA AGACQCCTAG TCTTAAAGAT QCTCACQBQG GCACATUraT TQCQGTTTCG 2280 



93 
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GAAAAAGAQG AGQGTATQX T&€GTCAGT 
GfiAGfCmi TAA MATGCT CAAGGCCATA 
CLI lUalLI I CTCAGTCTCA TGCCTCAAAA 
GGTTATAAGA GTGTAAATGT AGAAGAAGCG 

5 

S EQ ID W0 :12: 

ATGTTTGTTA CGGCGGrTTGT GTCGGTCTCT 
GAGCCCACAC AATCAMAGA TATAACCCGG 
AGAGAAGaA TACACAAGTTC CCAGGfiCOX 

10 CCGCCAACAG GCTCCACAAT CGTACGATTA 
CIIUblAAAA ACTTTACAGA OaSTATTGCT 
AAGnTAAQG CGACQGTATA TTACAAASAT 
TOTATACGC AMTTAaAA TAGATATCCG 
ACQGACACCA TTCATAAGn TQGCAAGTGT 

15 CACAAAGTTC AAGCCTTTM TCAQGATAAA 
AAATATAATT CTGTGG&ATC CAAAGCATQG 
GGAAOXCCG (yUCATATAG »CGQQCACG 
QCCAGATCAA TATTCCCTTA TCATAGTTTT 
TCCC Cbl l ll TTQQCCTACG GGATQGTOCA 

20 CGTTTTCACC AGTTTGNSQG TTATAGACAA 
CCTCCAGCQC QGAACrmr AGTCACQCCT 
AAACGAAOjG AAGmUTTC GCIIUILAAG 
GAGTATGCAC ACAATTTTCG CTTTACAATG 
ACAAACGAGT TrAATTTTAA CCAAATCCAT 

25 GCTATTATTA ACCQGATaA TACAACCAG^ 
ATCCAGACCT ACCTTGCC« WSOGOGGfTTT 
TCCCTCGCCC GTUTCTATCT CCAAGAATTG 
AAACACCCGA CTCGAAATAC CAGATCCCGA 
MaAACAATAA CAACCACXTC KWaSTOGAA 

30 ATTCA«AGC ATGTrAATCA AAlUIIUbCA 
AATCQCGAAC QCGCCnTTG GKCCtXJM 
ACCATTTTQG ATCAACGTCT TAAAQCTCGT 
TGTCCAGAAC TGC5GATCAGA TACACGaiT 
AGTACTAOiC GTTGTTATAG CCGTCCTTTA 

35 ACGGTQGAGG GCCAGCTTQG AACAGATAAC 
CaTGCGTQG aAATCACAA QCGATATnT 
GATTATCGnr ACGTCCGTGA AATCQCAGTC 
GATTTAAACT TAACACTTCT TAAAGATAGA 
GAOiflQCTGC QGGATACAQG ATTACTAS« 

40 CATTCGCTGC GTTnTATGA CATAGACAAG 
ATQCAQQQCA TGQCTCAGTT TTTCC«3t3GA 
Gmmil ll b QQGCCAOSQG AGCGCTGCTT 
TCTAWXCAT TTQQSSCAn QGCCGTQQGA 
TTTTTTGCGT ACCQGTACGT QCTTAAACTT 

45 CTCACAACCA AQQQSnAAA AOGHACCG 
AACGCTACTG ATACCCCAAT AGWWiAAATT 
AATAGCGQGT TTGfcTCCCGA TAAATTTOiA 
TTMTATCTS CGQCTGAGCG CCAAGAATa 
CTTTTAACTT CACGTCnTAC CQGCLIIbLT 

50 CQCACCGAG/^ ATGTAACGQG GGTT^AAA 



ggm::aaataa gtggtimtga atattuacaa 2340 

AAGTCnTMS fCGfiGKCTA (JGMSMfiA 2400 

CaAGTTTGA TAGACAGUT CAGGTATAGA 2460 

TGA 2493 



CCAWCTCGT TTTATGAGAG TTTACAAGTA 60 

TCTGCTCATC TQGQCGATQG TGATSAAATC 120 

GAAACAAAAC CCADSTTTTA CGTCTQCCCA 180 

GAACCAACK GGACATHTCC GGATTATIIAC 240 

U l I bl I l A TA AAGAAAACAT TQCAQCGTAC 300 

6TTATCGTTA GCACGGCGTG GQCCGGAAGT 360 

GATAGQGTAC CAATTCCCGT T7CAGAGATC 420 

TnraAAAG CAACGTACGT ACGAAATAAC 480 

AATIXACAQG ATATGCOa AATCGCATCA 540 

CATACTACCA ATGACACGTA CATQGTTGCC 600 

TOacnGAAn GCATCATTGA GGAAGTTGAA 660 

Q&CTrrcCA CQQGteATAT AATATACATS 720 

TACAG«yW: ATTCCAATTA TQCAATQGAT 780 

AQQGATUTTG ACACTA»GC ATTACTGGAA 840 

CATTrAACGG TTQGT7GGAA CTGGMGCCA 900 

TSQCGTGMjG TTGAAUCGT AGTTCGCGAT 960 

AAAACACTTT CTACCACCn TATAAGTGAA 1020 

CTCAGTIIAAT GTIGTAAAGGA GGUQCCCGG 1080 

TACAACTCAT CTCATGTTAG AACCQG(3GAT 1140 

blUilUjIti r TTCAACCCCT QCTCAQCAAT 1200 

GTOGTGAAA ACACTAATCA TTCACCACAA 1260 

CGAAGCGTGC CAGTTGAGTT QCGIGCCAAT 1320 

TTTQCTATQC TCCAGTTTAC ATATMCCAC 1380 

CGTATCrca CGraSTGGTG COCCTACAA 1440 

TTTCCAATTA ACCCAAGTGC TTTAXGAGC 1500 

ATTtTCQQCG ACGTTATaC CtilllLIAAT 1560 

ATACTTCAAA ACTUTATGAG GGTA7CTQGT 1620 

ATTTCAATAG TTAGTTTAAA TGOGTCCGGG 1680 

GAGTTAATTA TGTCDGAGA TCTCTTAGAA 1740 

CTATTTGGQC ATCAaACCT ATATTATGAG 1800 

CATGATGTOa GAATGATTAG CACTTACGTA 1860 

GAGTTTA7GC CQCTCCAAGT ATATACAAGA 1920 

TACA3JGAAA TTCAACGCCG AAATCAAATG 1980 

G i m i ia CAAT ATGATAQCGG AACQGCCATT 2040 

CTTGQGACC6 CGGQCCAGGC CGTTGGACAT 2100 

TCCACCGTAC ACQGATTTAC CA Ua ll ll lA 2160 

TTATTQGTTT TGGCQGGAa GGTAQOaQCC 2220 

AAAACAAGCC CGATGAAGQC ATTATATCCA 2280 

GAAGGAATQG ATCCCTTTGC CGAGAAACCC 2340 

G3CGAOCAC AAAACACTGA ACCGTCQGTA 2400 

GAAQCCCAjG AAATGAHAA ATATATGACG 2460 

AAAQCCCGCA AAAAAAATAA GACTAGCQCC 2520 

TTACGAAATC GCCGAQGATA CTCCCGTCTT 2580 

2608 



SEQ ID NO:U: 

ATGCaCCAQG GCGCCGCGCG GQQGTCCCQG -paGTTCGTCG TATQQGCGCT CrTQQSGTTG 60 

ACXTOSaS S TCCTGSTQQC GTU3GCQGCT CCGAGTTCCC CCQGCACGCC TQQQGTCGCG 12D 

55 GCCQCGACCC AGGCGSCGAA CQQQQGACCT GCCACTCCQG CQCCQCCCQC CCCTOSCCXC 180 

GCCCCAACQG GGGACACGAA ACCGAAQAAG AACAAAAAAC CGAAAAACCC ACCGCCGCCG 240 

CGCCCCGCCG GCGACAMSC GACCGTCGCC GCGSGCCACG CCACCCTGCG CGAQCACCTG 300 

CQQGACATCA AGQCGGAGAA CACCGATGCA AATTTTTACG TGTQCCCACC CaCACGBGC 360 

GCCACGGTQS TGCAGTTOGA GCAGCCGCGC CGCTGCCCGA CCCGGCCCGA GGSTCAGAAC 420 

60 TACACGGAQG GCATCQCGGT QL\U lUM GAGAACATCG CCCCGTACAA GTTCAAGXC 480 

ACCATCTAa ACAAAGACGT CACCCTTTCG CAGSTCTQGT TCGGCCACCG aACTCCCAG 540 

TTTATOQQGA TCTTTGAQGA CCGCGCCCCC GTCCCCTTC6 AQGAGGTGAT CGACAAGATC 600 

AACGCCAAOG Ujb l L I UICG GTCCACQGCC AAGTACGTGC GCAM^WXT GGAGACCACC 660 
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GCGnTTCACC GQGACGACCA CGAGftCCGAC 
CGCACGMSCC QGQGCTQQCA CACCACCGAC 
TTCCACXGST ACQQGACGAC GGTAAACTGC 
TACCOSTACG ACGMilllbl QCTQGCGAa 
5 QGCTACCGQG AGQQGfTCGCA CACCGAACAC 
SnGACQGCT TaACGCQOS CGAfXTCACC 
CGGAACCTQC TCACGACCCC CAAGTTCACC 
TCG3TCTGCA CCATCACCAA GTOQCAQGAG 
GGCTCCTTCC GATTCTCCTC CGACQCCATA 

10 TACCCGOrT CQCGCGTOGA CCTQQGQGAC 
GACCGCATCT TCQCCCQCAG GTACAAOjCG 
TACCTQGCCA ATGQQaQCTT TCTGATCQCG 
GAGCTGTACG TQCQGGAACA CCTCCGAGAG 
CCGCCGCCCG QQGCCAGCGC CAACGCGTCC 

15 GAGTTCQCCC QGOQCAGTT TACGTACAAC 
QGCCGCGTTG CCATCGCGTC GTCCGAGCTE 
GCCCGCAAQC TGAACCCCAA CGCC ATCGC C 
CQGATQCTCG QCGACGTCAT QGCCGirrCC 
A7CGTCCAAA ACTCGATGCG CATCAGCTO 

20 CTCAQCTTTC QtjTACGAAfiA CCAfitSGCCCG 
GA3CTGCGGC TBACGCGCGA TCCGATCWG 
ACCTTCQGTG QQQGCTACGT GTACTTCGA6 
GCCGACATCA CCACCGTCAG CAOnTCATC 
GAGTTnnCC CCCTOGAQGT GTACACCCGC 

25 TACAOjGAGG TCCAGCGCCG CAACCAGCTG 
GTCATCCACG CCGACGCCAA CGCCGCCATC 
ATQQQCGACC TQQGG^C^ GGTCQQCAAG 
TCQGCCGTGT CQQGCGTCTC CTCCTTCATG 
C7OTTGGTCC TQGCCQQCa QQMQCQOa 

30 CAGAQCAACC CCATGAAQQC CCTCTACCCG 
AACCCQGACG CGTCCGQ3GA QQGC&AGGAG 
GAQGCCCGGG AGATGATACG GTACATGGCC 
MCCOJM A&AAGQQCAC GAGCGCGCT5 
CGCAAGCGCC GCAACACCAA aACAOXAA 

35 GACGACCTGT GAC 



ATQSAGCTCA AACCGQCCAA CGCCGCGfiCC 720 

CTCAAGTACA ACCCCTCGCG GGTGGAQQCG 780 

AWtGAGG AGSTGGACGC GCGCTtQGTG MO 

QQCGACTTTG TSTACATCTC CCCGTTrTAC 900 

ACCAQCTACG CCQCCGftCCG CTTCAAfiCAG 960 

ACCAAGGCa QGXCACGGC GCCGACCACC 1020 

GTGaCTQQG ACTQQGTGCC AAAGCGOTCG 1080 

GTQGACGAGA TGCTrGCGCTC CGAGTACGGC 1140 

TCCACCACa TCACCACCAA CCTGACCGW 1200 

TGCATCGSCA AQSACQCCCG CGACGCCATG 1260 

ACQCACATCA AQtjTOQQCCA GCCGCWTAC 1320 

TACCAGCCCC TTaCAQCAA CACGCTCGCG 1380 

CAGAQCCGCA AGCCCCCAAA CCCCACGCCC 1440 

GTOGAQCQCA TCAAGACCAC CTCCT CCATC 1500 

CACATACAGC QCCATGTCAA CGATATGTTG 1560 

CAGAATCACG AQCTCACCa GTQGAACGAG 1620 

TCQQCCACCG TGSGCCfiWTi GGTGAGCQCG 1680 

AOmSCGTCC CQtnCGCCGC GGACAACGTG 1740 

rty^yrcmr. CCTQCTACAG CCGCCCCCTG 1800 

TTOGTOGMaG GQCAQGTBGG GGAGAACAAC 1860 

CCGTGCACCG TQQGACACCG QCQCTACTTC 1920 

GACTAOjCGT ACTCCCACCA GCTGWCCGC 1980 

GACCTCAACA TCACCATGCT QGAGSATCAC 2040 

CAOMaATCA AGGAOCCGG CCTXTGGAC 2100 

CACGMXTOC GCrrCGCCGA CATCGACACG 2160 

TTCQCQQGCX TOOQCGCGH CTTCGAQQCSG 2220 

GTQG^TGATSG GCATCGTGQG CQGCGJIQGTA 2280 

TCCAACCCa TTQQQQCGCT QQCCGTOQGT 2340 

TTCTTCGCa TTCGaACGT CATE3CQQCTG 2400 

aAACCACCA AGGAGCTCAA UACCCCACC 2460 

GGCGGCGAa TTGAC&AQQC CAAGCTAGCC 2520 

CTQGJGTCTG CCATBiftGCG CAOSGAACAC 2580 

CTCAQCGCCA AGGTCACCGA CATQCTCATG 2640 

GTTCCCAACA AAaftCQGTGA CGCCGACaW; 2700 

2713 



SEQ ID N0:14: 




























Met Val Pro 


Asn 


Lys 


His 


Leu 


Leu 


Leu 


He 


He 


Leu 


Ser 


Phe 


Ser 


Ttir 


Ala Cys Gly 


Gin 


Thr 


Thr 


Pro 


Thr 


Thr 


Ala 


Val 


Glu 


Lys 


Asn 


Lys 


Thr 


Gin Ala He 


Tyr 


Gin 


Glu 


Tyr 


Phe 


Lys 


Tyr 


Arg 


va1 


Cys 


Ser 


Ala 


Ser 


Thr Thr Gly 


Glu 


Leu 


Phe 


Arg 


Phe 


Asp 


Leu 


Asp 


Arg 


Thr 


Cys 


Pro 


Ser 


Thr Glu Asp 


Lys 


Val 


His 


Lys 


Glu 


Gly 


He 


Leu 


Leu 


Val 


Tyr 


Lys 


Lys 


Asn He Val 


Pro 


Tyr 


lie 


Phe 


Lys 


Val 


Arg 


Arg 


Tyr 


Lys 


Lys 


He 


Thr 


Thr Ser Val 


Arg 


He 


Phe 


Asn 


Gly 


Trp 


Thr 


Arg 


Glu 


Gly 


Val 


Ala 


He 


Thr Asn Lys 


Trp 


Glu 


Leu 


Ser 


Arg 


Ala 


Val 


Pro 


Lys 


Tyr 


Glu 


He 


Asp 


He Met Asp 


Lys 


Thr 


Tyr 


Gin 


Cys 


His 


Asn 


Cys 


»tet 


Gin 


He 


Glu 


Val 


Asn Gly Met 


Leu 


Asn 


Ser 


Tyr 


Tyr 


Asp 


Arg 


Asp 


Gly 


Asn 


Asn 


Lys 


Thr 


Val Asp Leu 


Lys 


Pro 


Val 


Asp 


Gly 


Leu 


Thr 


Gly 


Ala 


He 


Thr 


Arg 


Tyr 


He Ser Gin 


Pro 


Lys 


Val 


Phe 


Ala 


Asp 


Pro 


Gly 


Trp 


Leu 


Trp 


Gly 


Thr 


lyr Arg Thr 


Arg 


Thr 


Thr 


Val 


Asn 


Cys 


Glu 


He 


Val 


Asp 


t^t 


Phe 


Ala 


Arg Ser Ala 


Asp 


Pro 


Tyr 


Thr 


Tyr 


Phe 


Val 


Thr 


Ala 


Leu 


Gly 


Asp 


Thr 


Val G1u Va1 


Ser 


Pro 


Phe 


Cys 


Asp 


Val 


Asp 


Asn 


Ser 


Cys 


Pro 


Asn 


Ala 


Thr Asp Val 


Leu 


Ser 


Val 


Gin 


He 


Asp 


Leu 


Asn 


His 


Thr 


Val 


Val 


Asp 


Tyr Gly Asn 


Ar^ 


Ala 


Thr 


Ser 


Gin 


Gin 


His 


Lys 


Lys 


Arg 


He 


Phe 


Ala 


His Thr Leu 


Asp 


Tyr 


Ser 


Val 


Ser 


Trp 


Glu 


Ala 


Val 


Asn 


Lys 


Ser 


Ala 


Ser Val Cys 


Ser 


Met 


Val 


Phe 


Trp 


Lys 


Ser 


Phe 


Gin 


Arg 


Ala 


He 


Gin 


Thr Glu His 


Asp 


Leu 


Thr 


Tyr 


His 


Phe 


lie 


Ala 


Asn 


Glu 


He 


Thr 


Ala 


Gly Phe Ser 


Thr 


Val 


Lys 


Glu 


Pro 


Leu 


Ala 


Asn 


Phe 


Thr 


Ser 


Asp 


Tyr 


Asn Cys Leu 


Met 


Thr 


His 


lie 


Asn 


Thr 


Tlir 


Leu 


Glu 


Asp 


Lys 


He 


Ala 


Arg Val Asn 


Asn 


Tlir 


His 


Thr 


Pro 


Asn 


Gly 


Thr 


Ala 


Glu 


Tyr 


Tyr 


Gin 


Thr Glu Gly 


Gly 


Met 


lie 


Leu 


Val 


Trp 


Gin 


Pro 


Leu 


He 


Ala 


He 


Glu 


Leu Glu Glu 


Ala 


Met 


Leu 


Glu 


Ala 


Thr 


Thr 


Ser 


Pro 


Val 


Thr 


Pro 


Ser 


Ala Pro Thr 


Ser 


Ser 


Ser 


Arg 


Ser 


Lys 


Arg 


Ala 


He 


Arg 


Ser 


He 


Arg 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



SEC 



Asp Val 
Tyr Ala 
Leu Ala 
Tyr Glu 
Gly Lys 
Glu Cys 
Lys Thr 
Lys Phe 
Asn Glu 
Ala Glu 
Tyr Val 
T>>r Phe 
Ala Val 
Asp Leu 
Ser Gly 
Arg He 
Gly Lys 
Gly He 
Met Ptw 
Leu Va1 
Tyr Pro 
Glu Thr 
Ala Tyr 
Arg Gin 
He Ser 

ID N0:15: 
Met Tyr 
Phe Asn 
Ser Thr 
Thr Ser 
Ser Lys 
Thr Val 
Cys Ser 
Gin Thr 
Leu Val 
Tyr Arg 
Ala Ala 
Glu He 
Thr Val 
Asn Lys 
Asn Arg 
Pro Gly 
Met Tyr 
Gly Asp 
Thr Thr 
Glu Asn 
Asp Lys 
Ala Glu 
Phe Pro 
Ala Asn 
Glu Leu 
Asn Glu 
Arg Asp 
He Trp 
Thr He 
Ser Val 
Gin Val 
Leu Glu 
Leu Met 
Ala He 
Met Val 
Lys Ser 



Ser Ala 
Tyr Asp 
lie Thr 
He Ala 
Pro Val 
He Asn 
Glu Asn 
Val Asn 
He Leu 
Thr Phe 
H15 Val 
Leu Ala 
Glu Leu 
Glu Thr 
Leu Arg 
He Gin 
Val He 
Val Thr 
Thr Phe 
Arg Arg 
Asp Val 
He Lys 
Lys Lys 
Ala Ala 
Thr Glu 



Gly Ser 
Lys Leu 
Trp Cys 
Lys He 
Ser Arg 
val Asp 
Asn Asp 
Ser Ser 
Leu Ser 
Phe Thr 
Glu Thr 
Leu Asn 
Tyr Ser 
Met Phe 
Lys Asp 
Asp Phe 
Val Asn 
Gly He 
Leu Leu 
Thr Asn 
Glu Lys 
Gin He 
Lys Glu 
Glu Thr 
Asp Lys 



G1u Asn Asn 
Arg Gin Ser 
Arg Glu Gin 
Asn Pro Thr 
Lys Ala Leu 
Gin Ser Ser 
He Cys Tyr 
Gin Leu Phe 
Glu Ser Leu 
Ala Lys Asn 
Leu Pro Val 
Leu Thr Phe 
Ser Gly Glu 
Arg Glu Tyr 
Phe Asp Asn 
Ser Glu He 
Val Ala Ser 
Leu Asn Phe 
He Gly Ala 
Asn Met Ser 
Ser Lys Ser 
Leu Leu Gly 
Glu Gin Arg 
Phe Leu Arg 
He Val 



Val Phe Leu 
He Asn Asn 
Val Arg Gin 
Ser Val Met 
Gly Asp Val 
Val Ser He 
Ser Arg Pro 
Lys Gly Gin 
Val Glu Asn 
Glu Thr Tyr 
Asn Asn He 
He Glu Asn 
Arg Lys Leu 
Asn Tyr Tyr 
Ser Gin Arg 
Leu Ala Asp 
Gly Ala Phe 
He Lys Asn 
Val He He 
Gin Ala Pro 
Thr Val Thr 
Met His Asn 
Ala Ala Arg 
Lys Arg Ser 



Ser Gin 
Val Leu 
Thr Met 
Thr Ala 
He Ser 
His Lys 
Pro Val 
Leu Gly 
Cys His 
His Phe 
Ser Thr 
He Asp 
Ala Asn 
Ala Gin 
Asn Asn 
Leu Gly 
Ser Leu 
Pro Leu 
Leu Val 
He Arg 
Pro Met 
Met Gin 
Pro Ser 
Gly Tyr 



He Gin 
Glu Glu 
Val Trp 
He Tyr 
Val Thr 
Ser Leu 
Thr Phe 
Ala Arg 
Gin Asn 
Lys Asn 
Leu Asp 
Phe Lys 
Val Phe 
Ser He 
Arg Asp 
Ser He 
Phe Gly 
Gly Gly 
He Leu 
>tet He 
Glu Pro 
Gin Glu 
He Phe 
Lys Gin 



Tyr Lys 
Gin Thr 
Thr Ser 
Thr Asn 
Pro Ser 
Thr Asp 
Ala Ser 
Cys Pro 
Leu Lys 
Lys He 
Val Thr 
Ser Met 
He Asn 
Ser Val 
Tyr Phe 
He Tyr 
Ala Arg 
Thr He 
Gly Asn 
His Gin 
Arg He 
Asp Arg 
Arg Ala 
Glu Val 
Arg Gly 
Thr Leu 
Gly Glu 
Gin Ala 
Glu Arg 
Asp Thr 
Gin Tyr 
Glu Leu 
Trp Tyr 
Tyr Gly 
Ser Glu 
Met Arg 



Thr He 
Thr Thr 
Ser Thr 
Ser Ser 
Thr Asp 
Asp Thr 
Ser Ser 
Asp Thr 
Lys Asn 
Ala Thr 
Asn Arg 
He Asp 
Gly He 
Pro Leu 
Ser Gin 
Arg Val 
Ser Val 
Glu He 
Ser Thr 
Thr Val 
Phe Leu 
Glu Arg 
He Gin 
Thr Ala 
Asn Thr 
Glu Glu 
Val Lys 
Met Lys 
Asn Asn 
Lys Thr 
Ala Tyr 
Thr Lys 
Glu Leu 
Lys Pro 
Cys He 
Thr Asp 



Leu Phe 
His Ser 
Thr Thr 
Leu Ala 
Asn Gin 
Ala Ser 
Ser Gly 
Lys Asp 
He Val 
Ser Val 
Asp Asp 
Arg Thr 
Leu Asn 
Gin Pro 
Pro Leu 
Arg Thr 
Glu Pro 
Ser Pro 
Ser Arg 
Asp Tyr 
Lys Asp 
Ala He 
Thr He 
Ser Phe 
Glu He 
Thr Val 
Tyr Tyr 
Pro Leu 
Lys Thr 
Phe Gin 
Asp His 
Thr Trp 
Ser Lys 
Val Ala 
Asn Val 
Asp Pro 



Phe Ala 

Thr Thr 
Ser Thr 
Ala Ser 
Gly Thr 
Lys Asn 
Glu Leu 
Lys Lys 
Pro Tyr 
Thr Val 
He Ser 
Tyr His 
Thr Tyr 
Val Ala 
He Tyr 
Thr Val 
Tyr Thr 
Phe Cys 
Asp Ala 
Glu Arg 
Glu Glu 
Cys Asp 
His Asn 
Leu Thr 
Leu Asn 
Lys Lys 
Lys Thr 
Asn Leu 
Gly Asn 
Gly Ala 
Leu Arg 
Cys Arg 
He Asn 
Val Lys 
Asp Gin 
Lys Val 



Leu He 
Thr Ser 
Ser Lys 
Pro Gin 
Ser Thr 
Phe Tyr 
Phe Arg 
His Val 
He Phe 
Tyr Arg 
Arg Ala 
Cys Phe 
He Asp 
Gly Leu 
Ala Glu 
Asn Cys 
His Phe 
His Asn 
Thr Lys 
Arg Gly 
Tyr Thr 
Phe Val 
Glu Ser 
Ser Asn 
Cys Met 
Phe Asn 
Asn Gly 
Ser Glu 
Lys Ser 
Lys Gly 
Thr Ser 
Glu Gin 
Pro Val 
Ala Met 
Ala Ser 
Cys Tyr 

9(. 



Lys Val 
Pro Ser 
Pro Ser 
Asn Thr 
Pro Thr 
Lys Tyr 
Phe Asp 
Glu Gly 
Lys Val 
Gly Trp 
He Pro 
Ser Ale 
Arg Asp 
Thr Glu 
Pro Gly 
Glu Val 
He Thr 
Asn Ser 
Val Trp 
His Pro 
He Ser 
He Trp 
Phe His 
Gin Glu 
Asn Ser 
Lys Ser 
Gly Leu 
His Thr 
Arg Gin 
Leu Ser 
Met Asn 
Lys Lys 
Ser Val 
Gly Asp 
Val Asn 
Ser Arg 



Cys Ser 
He Ser 
Asn Thr 
Ser Thr 
He Pro 
Arg Val 
Leu Asp 
He Leu 
Arg Lys 
Ser Gin 
Tyr Asn 
Met Ala 
Ser Glu 
Asn He 
Trp Phe 
Val Asp 
Ala Leu 
Gin Cys 
He Glu 
Thr Lys 
Trp Lys 
Lys Thr 
Phe Val 
Glu Thr 
Thr He 
His He 
Phe Leu 
Asn Tyr 
Lys Arg 
Thr Ala 
His He 
Asp Asn 
Met Ala 
Ala Phe 
He His 
Pro Leu 
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Val 


Thr 


Phe 


Lys 


Phe 


Val 


Asn 


Ser 


Thr 


Ala 


Thr 


Pre Arg Gly bin Leu 


Gly 


Thr 


Arg 


Asn 


Glu 


He 


Leu 


Leu 


Thr 


Asn 


Thr 


His Val Giu Thr cys 


Arg 


Pro 


Thr 


Ala 


Asp 


His 


Tyr 


Phe 


Phe 


Val 


Lys 


Asn net ihr His lyr 


Phe 


Lys 


Asp 


Tyr 


Lys 


Phe 


Val 


Lys 


Thr 


Met 


Asp 


Thr Asn Asn He Ser 


Thr 


Leu 


Asp 


Thr 


Phe 


Leu 


Thr 


Leu 


Asn 


Leu 


Thr 


Phe He Asp Asn He 


Asp 


Phe 


Lys 


Thr 


Val 


Glu 


Leu 


Tyr 


Ser 


Glu 


Thr 


Glu Arg Lys net Aia 


Ser 


Ala 


Leu 


Asp 


Leu 


Glu 


Thr 


Met 


Phe 


Arg 


Glu 


Tyr Asn Tyr Tyr Thr 


Gin 


Lys 


Leu 


Ala 


Ser 


Leu 


Arg 


Glu 


Asp 


Leu 


Asp 


Asn Thr He Asp Leu 


Asn 


Arg 


Asp 


Arg 


Leu 


Val 


Lys 


Asp 


Leu 


Ser 


Glu 


Met net Ala Asp Leu 


61 y 


Asp 


He 


Gly 


Lys 


Val 


Val 


Val 


Asn 


Thr 


Phe 


T^jta tl*^ 

Ser Gly He Val Thr 


Val 


Phe 


Gly 


Ser 


He 


Val 


Gly 


Gly 


Phe 


Val 


Ser 


Phe Phe Thr Asn Pro 


He 


Gly 


Gly 


val 


Thr 


lie 


He 


Leu 


Leu 


Leu 


He 


Val Val Val Phe Val 


Vdl 


Phe 


He 


Val 


Ser 


Arg 


Arg 


Thr 


Asn 


Asn 


Het 


Asn Glu Ala Pro He 


Lys 


Met 


He 


Tyr 


Pro 


Asn 


He 


Asp 


Lys 


Ala 


Cam 

ber 


u 1 u u 1 n o 1 u Asn 1 1 e 


Gin 


Pro 


Leu 


Pro 


Gly 


Glu 


G1u 


He 


Lys 


Arg 


lie 


Leu Leu biy net nis 


Gin 


Leu 


Gin 


Gin 


Ser 


G1u 


His 


Gly 


Lys 


Cam 

Ser 


Glu 


blu ulu Ala xr His 


Lys 


Pro 


Gly 


Leu 


Phe 


Gin 


Leu 


Leu 


Gly 


Asp 


Gly 


Leu Gin Leu Leu Arg 


Arg 


Arg 


Gly 


Tyr 


Thr 


Arg 


Leu 


Pro 


Thr 


rhe 


ASp 


rPO xr rrO biy ASrl 


Asp 


Thr 


Ser 


Glu 


Thr 


His 


Gin 


Lys 


Tyr 


vai 






SEQ 10 N0:16; 






















Met 


Gly 


Val 


Gly 


Gly 


Gly 


Pro 


Arg 


Val 


Vai 


Leu 


Cys Leu Trp cys vai 


Ala 


Ala 


Leu 


Leu 


Cys 


Gin 


Gly 


Val 


Ala 


Gin 


Glu 


lt*.l Mm.^ A1« 

Val Val Ala Glu Thr 


Thr 


Thr 


Pro 


Phe 


Ala 


Thr 


His 


Arg 


Pro 


Glu 


Val 


Val Ala ulu Glu Asn 


Pro 


Ala 


Asn 


Pro 


Phe 


Leu 


Pro 


Phe 


Arg 


Val 


Cys 


Gly Ala Ser Pro Thr 


fil V 


Gl V 


Glu 


He 


Phe 


Arg 


Phe 


Pro 


Leu 


Glu 


G1u 


Ser Cys Pro Asn Thr 


Klu 

U I u 




Lys 


Asp 


His 


He 


Glu 


Gly 


He 


Ala 


Leu 


He Tyr Lys Thr Asn 


1 1 c 


TO 1 


Pro 


Tyr 


Val 


Phe 


Asn 


Val 


Arg 


Lys 


Tyr 


Arg Lys He Het Thr 


ier 


Thr 

1 nr 


Thr 


He 


Tyr 


Lys 


Gly 


Trp 


Scr 


Glu 


Asp 


Ala He Thr Asn Gin 


niS 


1 nr 


Arg 


Ser 


Tyr 


Ala 


Val 


Pro 


Leu 


Tyr 


Glu 


Val Gin Het Met Asp 


HIS 


Tyr 


Tyr 


Gin 


Cys 


Phe 


Ser 


Ala 


Val 


Gin 


Val 


Asn Glu Gly Gly His 


va 1 




Thr 


Tyr 


Tyr 


Asp 


Arg 


Asp 


Gly 


Trp 


Asn 


Glu Thr Ala Phe Leu 


Lys 


Kro 


Ala 


Asp 


Gly 


Leu 


Thr 


Ser 


Ser 


He 


Thr 


Arg Tyr Gin Ser Gin 


Pro 


Glu 


Val 


Tyr 


Ala 


Thr 


Pro 


Arg 


Asn 


Leu 


Leu 


Trp Ser Tyr Thr Thr 


Arg 


Thr 


Thr 


Val 


Asn 


Cys 


Glu 


Val 


Thr 


Glu 


Met 


Ser Ala Arg Ser Met 


Lys 


Pro 


Phe 


Glu 


Phe 


Phe 


Val 


Thr 


Ser 


Val 


Gly 


Asp Thr He Glu Het 


Ser 


Pro 


Phe 


Leu 


Lys 


Glu 


Asn 


Gly 


Thr 


Glu 


Pro 


Glu Lys He Leu Lys 


Arg 


Pro 


His 


Ser 


He 


Gin 


Leu 


Leu 


Lys 


Asn 


Tyr 


Ala Val Thr Lys Tyr 


Gly 


Val 


Gly 


Leu 


Gly 


Gin 


Ala 


Asp 


Asn 


Ala 


Thr 


Arg Phe Phe Ala He 


Phe 


Gly 


Asp 


Tyr 


Ser 


Leu 


Ser 


Trp 


Lys 


Ala 


Thr 


Thr Glu Asn Ser Ser 


Tyr 


Cys 


Asp 


Leu 


He 


Leu 


Trp 


Lys 


Gly 


Phe 


Ser 


Asn Ala He Gin Thr 


Gin 


His 


Asn 


Ser 


Ser 


Leu 


His 


Phe 


He 


Ala 


Asn 


Asp He Thr Ala Ser 


Phe 


Scr 


Thr 


Pro 


Leu 


Glu 


Glu 


Glu 


Ala 


Asn 


Phe 


Asn Glu Thr Phe Lys 


Cys 


He 


Trp 


Asn 


Asn 


Thr 


Gin 


Glu 


Glu 


He 


Gin 


Lys Lys Leu Lys Glu 


Val 


Glu 


Lys 


Thr 


His 


Arg 


Pro 


Asn 


Gly 


Thr 


Ala 


Lys Val Tyr Lys Thr 


Thr 


Gly 


Asn 


Leu 


Tyr 


He 


Val 


Trp 


Gin 


Pro 


Leu 


He Gin He Asp Leu 


Leu 


Asp 


Thr 


His 


Ala 


Lys 


Leu 


Tyr 


Asn 


Leu 


Thr 


Asn Ala Thr Ala Ser 


Pro 


Thr 


Ser 


Thr 


Pro 


Thr 


Thr 


Ser 


Pro 


Arg 


Arg 


Arg Arg Arg Asp Thr 


Scr 


Ser 


Val 


Ser 


Gly 


Gly 


Gly 


Asn 


Asn 


Gly 


Asp 


Asn Ser Thr Lys Glu 


G1u 


Ser 


Val 


Ala 


Ala 


Ser 


Gin 


Val 


Gin 


Phe 


Ala 


Tyr Asp Asn Leu Arg 


Lys 


Ser 


lie 


Asn 


Arg 


Val 


Leu 


Gly 


Glu 


Leu 


Ser 


Arg Ala Trp Cys Arg 


Glu 


Gin 


Tyr 


Arg 


Ala 


Ser 


Leu 


Met 


Trp 


Tyr 


Glu 


Leu Ser Lys He Asr 


Pro 


Thr 


Ser 


Val 


Het 


Ser 


Ala 


He 


Tyr 


Gly 


Arg 


Pro Val Ser Ala Lys 


Leu 


He 


Gly 


Asp 


Val 


Val 


Ser 


Val 


Ser 


Asp 


Cys 


He Ser Val Asp Gin 


Lys 


Ser 


Val 


Phe 


Val 


His 


Lys 


Asn 


t^t 


Lys 


Val 


Pro Gly Lys Glu Asp 


Leu 


Cys 


Tyr 


Thr 


Arg 


Pro 


Val 


Val 


Gly 


Phe 


Lys 


Phe He Asn Gly Ser 


Glu 


Leu 


Phe 


Al8 


Gly 


Gin 


Leu 


Gly 


Pro 


Arg 


Asn 


Glu He Val Leu Ser 


Thr 


Ser 


Gin 


Val 


Glu 


Val 


Cys 


Gin 


His 


Ser 


Cys 


Glu His Tyr Phe Gin 


A1a 


Gly 


Asn 


Gin 


Het 


Tyr 


Lys 


Tyr 


Lys 


Asp 


Tyr 


Tyr Tyr Val Ser Thr 


Leu 


Asn 


Leu 


Thr 


Asp 


He 


Pro 


Thr 


Leu 


His 


Thr 


Met He Thr Leu Asn 


Leu 


Scr 


Leu 


Val 


Glu 


Asn 


He 


Asp 


Phe 


Lys 


Val 


He Glu Leu Tyr Ser 


Lys 


Thr 


Glu 


Lys 


Arg 


Leu 


Ser 


Asn 


Val 


Phe 


Asp 


He Glu Thr Met Phe 


Arg 


Glu 


Tyr 


Asn 


Tyr 


Tyr 


Thr 


Gin 


Asn 


Leu 


Asn 


Gly Leu Arg Lys Asp 
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Leu Asp Asp Ser He Asp His Gly Arg Asp Sep Phe He Gin Thr Leu 
Gly Asp lie Met G1n Asp Leu G1y Thr He Gly Lys Val Val Val Asn 
Val Ala Ser Gly Val Phe Ser Leu Phe Gly Ser He Val Ser Gly Val 
He Ser Phe Phe Lys Asn Pro Phe Gly Gly Het Leu Leu He Val Leu 
5 He He Ala Gly Val Val Val Val Tyr Leu Phe Met Thr Arg Ser Arg 

Ser He Tyr Ser Ala Pro He Arg Met Leu Tyr Pro Gly Val Glu Arg 
Ala Ala Gin Glu Pro Gly A1a His Pro Val Ser Glu Asp Gin He Arg 
Asn lie Leu Met Gly Met His Gin Phe Gin Gin Arg Gin Arg Ala Glu 
Glu Glu Ala Arg Arg Glu Glu Glu Val Lys Gly Lys Arg Thr Leu Phe 
10 Glu Val He Arg Asp Ser Ala Thr Ser Val Leu Arg Arg Arg Arg Gly 

Gly Gly Gly Tyr Gin Arg Leu Gin Arg Asp Gly Ser Asp Asp Glu Gly 
Asp Tyr Glu Pro Leu Arg Arg Gin Asp Gly Gly Tyr Asp Asp Val Asp 
Val Glu Ala Gly Thr Ala Asp Thr Gly Val 



15 


SEQ ID HO: 17 








Hot 


Tvr 
lyr 


Pro 


Thr Val L vs Ser 




t Ml 

Leu 


Thr 

1 nr 




bCU LpCU 




Prn 


Thr 

1 III 


Thr 

1 III 


t Ml Am (m\f\ Pm 

LiCU rV y U 1 II rl u 




Prn 


Thr 

1 ill 


u t u 


Thr Pro Pro Pro 

fill 1 f V rtv r f V 




u ■ u 


Tvr 
lyr 


r 1 nr 


Ara VaI Tvc Rlv 




Ph* 


MSp 


I Ml 
LCU 


Asn t vs Thr Cv5 




UlU 


ft! V 


Tip 


1 mi 1 Ml Vxl Tvr 

LC\J L.CU T Q 1 1 Jn 




Lys 


I le 




Am Twip ArA 1 vc 
wry 1 jTi ftry ljta 




Gl V 
\3iy 




Thr 


Thr UaI 




Al 8 


Ml 5 


Glu 

U 1 u 


Trn Glu Val Glv 

lip UlU Vfll Ulj 




Asn 


Ser 


Aia 


Tkiv Ua^ UkI 

\ nr nex va i va i 




Arg 


Asp 


Gly 


vai Asn Lys inr 




Thr 


Gly 


Asn 


He Gin Arg Tyr 




Pro 


Gly 


Trp 


net rro uiy rne 


'9n 
3U 


ulu 


lie 


val 


Am Ua» Usi AIk 

Asp net vai Aia 




Ala 


Thr 


Ala 


Leu u 1 y Asp jer 




Asp 


Asn 


1 nr 


ser um Lys inr 




Arg 


Glu 


vai 


Lys Asn Tyr Lys 




Aim 

Ala 


Pro 


Ala 


bty um ber Arg 


oo 


Tyr 


Ser 


Trp 


1 Tk*> Als l>r 

Lys 1 nr A 1 a inr 




HI S 


Trp 


Lys 


Dbuk Aw*fm 

inr rne rro Arg 




Tyr 


His 


rne 


Uil A1 s Acn i^lii 

V9 1 Aia Asn ulu 




Thr 


G1u 


va t 


r'lii A» OkA T^o 

blu Asn rne inr 




lie 


Asn 


Lys 


Tnr lie ser biu 




Val 


Ala 


Cab 

ser 


Gly Lys Thr bin 




Leu 


I le 


Trp 


Gin Pro Leu Glu 




Asp 


Ser 


Asp 


rro uiu rio inr 




Lys 


Arg 


Glu 


Ala Ala Asp Asn 




Gly 


Ser 


Glu 


Asn Pro Leu He 


45 


Lys 


Leu 


Thr 


Thr Ser Val Asn 




Trp 


Cys 


Arg 


Glu Gin Val Arg 




Lys 


Val 


Asn 


Pro Thr Ser Val 




Ala 


A1a 


Arg 


Tyr Val Gly Asp 




Val 


Asp 


Gin 


Ser Ser Val Asn 


50 


Asp 


Lys 


Thr 


Thr Cys Tyr Ser 




Asn 


Ser 


Thr 


Asp Pro Leu Thr 




He 


Leu 


Ser 


Asn Thr Asn He 




Tyr 


Phe 


He 


Val Gly Glu Tyr 




Glu 


Glu 


Lys 


Leu Asn Leu Ser 


55 


Ala 


Leu 


Asn 


He Ser Phe He 




Leu 


Tyr 


Ser 


Ser Thr Glu Arg 




G1u 


Ser 


Het 


Phe Arg Glu Tyr 




He 


Lys 


Lys 


Asp Leu Asp Asn 




Val 


Gin 


Asp 


Leu Ser Asp Het 


60 


Ser 


Val 


Va1 


Asn Val Val Ser 




Val 


Thr 


Gly 


Phe He Lys Phe 




lie 


Leu 


Leu 


He He Gly Gly 




Arg 


Arg 


Asn 


Ser Gin Phe His 



Met Arg Val Ala His Leu Thr Asn Leu 
Cys His Thr His Leu Tyr Val Cys Gin 
Ser Asp ftet Thr Pro Ala Gin Asp Ala 
Leu Ser Thr Asn Thr Asn Arg Gly Phe 
Val Ala Ala Thr Gly Glu Thr Phe Arg 
Pro Ser Thr Gin Asp Lys Lys His Val 
Lys He Asn He Val Pro Tyr He Phe 
He He Thr Gin Leu Thr He Trp Arg 
Thr Gly Lys Phe Glu Met Ala Thr Gin 
Asp Phe Asp Ser He Tyr Gin Cys Tyr 
Asn Asn Val Arg Gin Val Tyr Val Asp 
Val Asn He Arg Pro Val Asp Gly Leu 
Phe Ser Gin Pro Thr Leu Tyr Ser Glu 
Tyr Arg Val Arg Thr Thr Val Asn Cys 
Arg Ser Met Asp Pro Tyr Asn Tyr He 
Leu Glu Leu Ser Pro Phe Gin Thr Phe 
Ala Pro Lys Arg Ala Asp Met Arg Val 
Phe Val Asp Tyr Asn Asn Arg Gly Thr 
Thr Phe Leu Glu Thr Pro Ser Ala Thr 
Arg Gin Thr Als Thr Cys Asp Leu Val 
Ala He Gin Thr Ala His Glu His Ser 
Val Thr Ala Thr Phe Asn Thr Pro Leu 
Ser Thr Tyr Ser Cys Val Ser Asp Gin 
Tyr He Gin Lys Leu Asn Asn Ser Tyr 
Tyr Phe Lys Thr Asp Gl y Asn Leu Tyr 
His Pro Glu He Glu Asp He Asp Glu 
Pro Ala Pro Pro Lys Ser Thr Arg Arg 
Gly Asn Ser Thr Ser Glu Val Ser Lys 
Thr Ala Gin He Gin Phe Ala Tyr Asp 
Asn Val Leu Glu Glu Leu Ser Arg Ala 
Asp Thr Leu Het Trp Tyr Glu Leu Ser 
Met Ser Ala He Tyr Gly Lys Pro Val 
Ala He Ser Val Thr Asp Cys He Tyr 
He His Gin Ser Leu Arg Leu Gin His 
Arg Pro Arg Val Thr Phe Lys Phe He 
Gly Gin Leu Gly Pro Arg Lys Glu He 
Glu Thr Cys Lys Asp Glu Ser Glu His 
He Tyr Tyr Tyr Lys Asn Tyr He Phe 
Ser He Ala Thr Leu Asp Thr Phe He 
Glu Asn He Asp Phe Lys Thr Val Glu 
Lys Leu Ala Ser Ser Val Phe Asp He 
Asn Tyr Tyr Thr Tyr Ser Leu Ala Gly 
Thr He Asp Tyr Asn Arg Asp Arg Leu 
»tet Ala Asp Leu Gly Asp He Gly Arg 
Ser Val Val Thr Phe Phe Ser Ser He 
Phe Thr Asn Pro Leu Gly Gly He Phe 
He He Phe Leu Val Val Val Leu Asn 
Asp Ala Pro He Lys Met Leu Tyr Pro 



wo 97/12042 



PCT/l)S96/l5702 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Leu 

SEQ ID HO: 18: 
Met Thr 
Ala Cys 
Thr Thr 
Arg Va) 
Asp He 
Leu Leu 
Arg Ser 
Tyr Ala 
Ser Tyr 
Val Lys 
Gly Val 
Gly Val 
Trp Leu 
Thr Asp 
Thr Thr 
Lys Glu 
Tyr Lys 
Arg Arg 
Glu Asn 
Ser Thr 
Glu Gly 
Pro Asp 
Glu Lys 
He Thr 
Leu Thr 
Thr Pro 
Ala Ala 
Asp Ala 
Ser Leu 
Tyr Asp 
Arg Ala 
Leu Thr 
Ala Val 
Val Pro 
Pro Gly 
Phe He 
Glu He 
Gin Tyr 
His His 
Phe He 
Leu Glu 
Leu Glu 
Gly Leu 
Phe Val 
Gin Ser 
Leu Val 
Leu He 
Thr Arg 
Pro Gly 
Gly He 
Leu His 
Gly Pro 
Pro Gly 
Leu Leu 



Arg Arg Arg 
Arg Leu Gly 
Val Gin Pro 
Cys Glu Leu 
Gin Cys Pro 
Met Val Phe 
Tyr Thr Lys 
Asp Ser Val 
Glu Thr Asp 
Met Thr Lys 
Asn He Thr 
Arg Arg Tyr 
He Trp Thr 
Met Met Ala 
Gly Gin Thr 
Thr Phe His 
He Val Asp 
Ala Phe Leu 
Arg Thr Ala 
He Ala Thr 
Thr Ser Ser 
Ala Phe Lys 
Tyr Glu Ala 
Tyr Phe He 
Pro Arg Ser 
Thr Ser Ser 
Arg Gly Ser 
Gly Asn Ala 
Gly Thr Leu 
Ser Leu Arg 
Trp Cys Leu 
Lys He Asn 
Ala Ala Lys 
Val Asn Gin 
Ser Glu Thr 
Asn Asp Thr 
Phe Leu Thr 
Tyr Phe Gin 
Phe Lys Thr 
Ser Leu Asn 
Leu Tyr Ser 
Gly He Phe 
Arg Lys Asp 
Asp Gly Leu 
He Thr Asn 
Ser Gly Phe 
Leu Val Leu 
Arg Thr Arg 
He Asp Glu 
Asn Pro He 
Glu Gin Asn 
Ser Val Ala 
Leu Arg Arg 
Gly Glu Ala 



Val Leu 
Ala Gin 
Thr Ala 
Ser Ser 
Ser Phe 
Lys Asp 
He Val 
Thr Asn 
Gin Met 
Asp Gly 
Val Asn 
Ala Ser 
Tyr Arg 
Lys Ser 
Val Glu 
Glu Arg 
Tyr Asp 
Asp Lys 
Tyr Cys 
Glu Thr 
Phe Val 
Cys He 
Val Gin 
Thr Ser 
Leu Ala 
Pro Pro 
Thr Pro 
Thr Thr 
Asn Asn 
Arg Gin 
Glu Gin 
Pro Thr 
Arg Leu 
Ala Thr 
Met Cys 
Lys Thr 
Lys Lys 
Ser Gly 
He Glu 
Thr Ser 
Arg Asp 
Arg Glu 
Leu Asp 
Gly Glu 
Leu Val 
He Ser 
Val Ala 
Gin Met 
Leu Ala 
Ser Lys 
Gin Glu 
Ser Arg 
Arg Arg 
Glu Thr 



Ser Val 
Thr Pro 
Thr Arg 
His Gly 
Gly Thr 
Asn He 
Thr Asn 
Arg His 
Asp Thr 
Leu Thr 
Leu Lys 
Gin Thr 
Thr Arg 
Asn Ser 
Met Ser 
Ala Asp 
Asn Arg 
Gly Thr 
Pro Leu 
Gly Lys 
Thr Asn 
Glu Glu 
Asp Arg 
Gly Gly 
Thr Val 
Ser Ser 
Ala Ala 
Pro Val 
Pro Ala 
He Asn 
Lys Arg 
Thr Val 
Gly Asp 
Val Thr 
Tyr Ser 
Tyr Glu 
Met Thr 
Asn Glu 
Leu Asp 
Leu He 
Glu Gin 
Tyr Asn 
Asn Ala 
Leu Met 
Ser Thr 
Phe Phe 
Gly Val 
Ser Gin 
Gin Gin 
Thr Glu 
Gin Lys 
Ala Leu 
Tyr His 
Glu Phe 



Ala 


Pro 


Pro 


Pro 


Tyr 


Ser 


Ala 


He 


Lys 


Arg 


He 


Leu 


Leu 


Gly 


Glu 


Ala 


Gin 


Lys 


Gin 


Leu 


Thr 


Ala 


Thr 


Gly 


Phe 


Leu 


Arg 


Asn 


Leu 


Glu 


Asp 


Glu 


Ser 


Thr 


Ser 


Val 


Val 


Leu 


Leu 


Ala 


Ala 


Leu 


Glu 


Gin 


Pro 


Ala 


Pro 


Pro 


Ala 


Gin 


Gin 


Thr 


Ser 


Phe 


Pro 


Phe 


Asp 


Leu 


Phe 


Arg 


Phe 


Ser 


Ser 


Arg 


Glu 


Asn 


His 


1 nr 


ulU 


Gly 


He 


Pro 


Tyr 


Ser 


Phe 


Lys 


Val 


He 


Leu 


He 


Tyr 


Asn 


Gly 


Trp 


Glu 


Glu 


Lys 


Phe 


Ser 


Val 


Asp 


He 


Tyr 


Gin 


Cys 


Tyr 


Asn 


Ala 


Arg 


Val 


Tyr 


Val 


Asp 


Arg 


Asp 


Pro 


Thr 


Gly 


Gly 


Leu 


Ala 


Asn 


Glu 


Leu 


Tyr 


Asp 


Ala 


Pro 


Gly 


Thr 


Thr 


Val 


Asn 


Cys 


Leu 


He 


Pro 


Phe 


Asp 


Phe 


Phe 


Val 


Thr 


Pro 


Phe 


Tyr 


Asp 


Gly 


Lys 


Asn 


Ser 


Phe 


His 


Val 


Arg 


Thr 


Asn 


Gly 


Thr 


Asn 


Pro 


Gin 


Gly 


Glu 


Tyr 


Thr 


Leu 


Ser 


Trp 


Lys 


Leu 


Gin 


His 


Trp 


Gin 


Thr 


Phe 


Asp 


Ser 


He 


His 


Phe 


Val 


Thr 


Asp 


Thr 


Thr 


Val 


Gly 


He 


Glu 


Leu 


Gin 


Val 


Asn 


Lys 


Thr 


Met 


His 


Tyr 


Thr 


Lys 


Gly 


Gin 


Glu 


Ala 


Leu 


Leu 


Leu 


Ala 


Trp 


Leu 


Pro 


Lys 


Asn 


Leu 


Thr 


Glu 


Leu 


Thr 


Pro 


Ser 


Pro 


Pro 


Ala 


Pro 


Ser 


Val 


Leu 


Arg 


Arg 


Arg 


Arg 


Arg 


Pre 


Pro 


Thr 


Ala 


Pro 


Gly 


Lys 


Thr 


Val 


Gin 


He 


Gin 


Phe 


Ala 


Arg 


Met 


Leu 


Gly 


Asp 


Leu 


Ala 


Gin 


Asn 


Met 


Val 


Leu 


Arg 


Glu 


Met 


Ser 


Ser 


He 


Tyr 


Gly 


Lys 


Val 


He 


Ser 


Val 


Ser 


Gin 


Cys 


Leu 


Arg 


Lys 


Ser 


Met 


Arg 


Val 


Arg 


Pro 


Leu 


Val 


Ser 


Phe 


Ser 


Gly 


Gin 


Leu 


Gly 


Thr 


Asp 


Asn 


Glu 


val 


Cys 


Gin 


Ala 


Thr 


Ser 


He 


His 


Val 


Tyr 


Asn 


Asp 


Tyr 


Gly 


He 


Ala 


Thr 


Leu 


Gin 


Thr 


Glu 


Asn 


He 


Asp 


Phe 


Ala 


Ser 


Arg 


Ala 


Ser 


Asn 


Val 


Phe 


Asp 


Phe 


Gin 


Ala 


Gin 


Asn 


He 


Ala 


Val 


Ser 


Asn 


Gly 


Arg 


Asn 


Gin 


Asp 


Ser 


Leu 


Gly 


Ser 


Val 


Gly 


Val 


Gly 


Gly 


Leu 


Phe 


Ser 


Ser 


Lys 


Asn 


Pro 


Phe 


Gly 


Gly 


Met 


Val 


He 


Leu 


Val 


He 


Ser 


Leu 


Gin 


Pro 


Val 


Gin 


Met 


Leu 


Tyr 


His 


Ala 


Ser 


Gly 


Glu 


Gly 


Pro 


Leu 


Gin 


Ala 


He 


Met 


Leu 


Ala 


Arg 


Ala 


A1a 


Gin 


Arg 


Ala 


Ala 


Gin 


Ala 


Ala 


Arg 


Asp 


Arg 


Phe 


Asp 


Pro 


Glu 


Thr 


Ala 


Ala 


Ala 
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SEQ ID NO: 19: 




Het G1u Ser 




Val Cys Leu 




Ala Thr His 


5 


Arg Ser Gly 




His Gly Val 




Val Val Gly 




Ala Gin Gly 




Ser Met Lys 


10 


Tyr Lys Arg 




Lys Val Leu 




Leu Leu Gly 




His His He 




He Ala Gly 


15 


Lys Thr Met 




Arg Tyr Val 




Leu Tyr Arg 




Ala Arg Ser 




Val Val Asp 


20 


Tyr Phe Gly 




He Val Ser 




Leu Val Ala 




Gin Asp Glu 




Glu Arg Thr 


25 


Ala Lys Het 




Ser Asp Ser 




Gin Gin He 




Asn Val Ser 




Gly He Lys 


30 


Ser Ser Leu 




Asn Asn Ala 




Tyr Ala Gin 




Arg Ala Leu 




T>ir Leu Glu 


35 


Leu Ser Ala 




Val Leu Gly 




Val Leu Arg 




Arg Pro Val 




Gly Gin Leu 


40 


Glu Glu Cys 




Ala Tyr Glu 




Ser He Ser 




Glu Asn Thr 




Arg Ser Ser 


45 


Ser Tyr Lys 




Leu Pro Pro 




Ala Ala Gly 




Val Ala Ser 




Gly Ala Phe 


SO 


Tyr Leu He 




Asn Leu Phe 




Gly Asn Thr 




Ser Val Tyr 




Ala Ser Thr 


55 


Leu Leu Ala 




Gly Thr Asp 




Lys Pro Asn 




His Leu Lys 



Arg He Trp Cys Leu Val Val 
Gly Ala Ala Val Ser Ser Ser 
Ser His His Ser Ser His Thr 
Ser Val Ser Gin Arg Val Thr 
Asn Glu Thr He Tyr Asn Thr 
Val Asn Thr Thr Lys Tyr Pro 
Thr Asp Leu He Arg Phe Glu 
Pro He Asn Glu Asp Leu Asp 
Asn He Val Ala His Thr Phe 
Thr Phe Arg Arg Ser Tyr Ala 
Ser Asn Thr Glu Tyr Val Ala 
Asn Ser His Ser Gin Cys Tyr 
Thr Val Phe Val Ala Tyr His 
Gin Leu Met Pro Asp Asp Tyr 
Thr Val Lys Asp Gin Trp His 
Glu Thr Cys Asn Leu Asn Cys 
Lys Tyr Pro Tyr His Phe Phe 
He Ser Pro Phe Tyr Asn Gly 
Glu Asn Ala Asp Lys Phe Phe 
Asp Phe Gly Arg Pro Asn Ser 
Phe Leu Glu Arg Ala Asp Ser 
Lys Asn Val Thr Cys Gin Leu 
He Arg Ser Glu Ala Glu Asp 
Thr Ala Thr Phe Leu Ser Lys 
Ala Leu Asp Cys Val Arg Asp 
Phe Asn Thr Ser Tyr Asn Gin 
Val Phe Glu Thr Thr Gly Gly 
Gin Lys Ser Leu Val Glu Leu 
Asn Leu Thr His Asn Arg Thr 
Thr His Leu Ser Asn Met Glu 
Leu Gin Phe Thr Tyr Asp Thr 
Ala Gin He Ala Glu Ala Trp 
Val Phe Lys Glu Leu Ser Lys 
He Tyr Asn Lys Pro He Ala 
Leu Ala Ser Cys Val Thr He 
Asp Met Asn Val Lys Glu Ser 
Val He Phe Asn Phe Ala Asn 
Gly Glu Asp Asn Glu He Leu 
Gin Leu Pro Ser Leu Lys He 
Tyr Val Asp Tyr Leu Phe Lys 
Thr Val Asp Ser Met He Ala 
Asp Phe Arg Val Leu Glu Leu 
Asn Val Phe Asp Leu Glu Glu 
Gin Arg Val Lys Tyr Val Glu 
Tyr Leu Lys Gly Leu Asp Asp 
Lys Ala Val Gly Val Ala He 
Val Val Glu Gly Val Ala Thr 
Thr He He Leu Val Ala He 
Tyr Thr Arg Gin Arg Arg Leu 
Pro Tyr Leu Val Ser Ala Asp 
Lys Asp Thr Ser Leu Gin Ala 
Asn Ser Gly Arg Lys Gly Pro 
Ala Ala Pro Pro Tyr Thr Asn 
Leu Val Arg Leu Asp Ala Glu 
Ser Leu Asp Gly Gin Thr Gly 
Leu Leu Asp Arg Leu Arg His 
Asp Ser Asp Glu Glu Glu Asn 



Cys Val Asn Leu Cys He 
Ser Thr Arg Gly Thr Ser 
Thr Ser Ala Ala His Ser 
Ser Ser Gin Thr Val Ser 
Thr Leu Lys Tyr Gly Asp 
Tyr Arg Val Cys Ser Het 
Arg Asn He Val Cys Thr 
Glu Gly He Het Val Val 
Lys Val Arg Val Tyr Gin 
Tyr He His Thr Thr Tyr 
Pro Pro Met Trp Glu He 
Ser Ser Tyr Ser Arg Val 
Arg Asp Ser Tyr Glu Asn 
Ser Asn Thr His Ser Thr 
Ser Arg Gly Ser Thr Trp 
Met Val Thr He Thr Thr 
Ala Thr Ser Thr Gly Asp 
Thr Asn Arg Asn Ala Ser 
He Phe Pro Asn Tyr Thr 
Ala Leu Glu Thr His Arg 
Val He Ser Trp Asp He 
Thr Phe Trp Glu Ala Ser 
Ser Tyr His Phe Ser Ser 
Lys Gin Glu Val Asn Het 
Glu Ala He Asn Lys Leu 
Thr Tyr Glu Lys Tyr Gly 
Leu Val Val Phe Trp Gin 
Glu Arg Leu Ala Asn Arg 
Lys Arg Ser Thr Asp Gly 
Ser Val His Asn Leu Val 
Leu Arg Gly Tyr He Asn 
Cys Val Asp Gin Arg Arg 
He Asn Pro Ser Ala He 
Ala Arg Phe Het Gly Asp 
Asn Gin Thr Ser Val Lys 
Pro Gly krg Cys Tyr Ser 
Ser Ser Tyr Val Gin Tyr 
Leu Gly Asn His Arg Thr 
Phe He Ala Gly Asn Ser 
Arg Met He Asp Leu Ser 
Leu Asp He Asp Pro Leu 
Tyr Ser Gin Lys Glu Leu 
He Met Arg Glu Phe Asn 
Asp Lys Val Val Asp Pro 
Leu Met Ser Gly Leu Gly 
G1y A1a Val G1y Gly Ala 
Phe Leu Lys Asn Pro Phe 
Ala Val Val He He He 
Cys Met Gin Pro Leu Gin 
Gly Thr Thr Val Thr Ser 
Pro Pro Ser Tyr Glu Glu 
Gly Pro Pro Ser Ser Asp 
Glu Gin Ala Tyr Gin Met 
Gin Arg Ala Gin Gin Asn 
Thr Gin Asp Lys Gly Gin 
Arg Lys Asn Gly Tyr Arg 
Val 



60 SEQ ID N0:20: 

Met Ser Lys t^t Val Val Leu Phe Leu Ala Val Phe Leu Met Asn Ser 
Val Leu Met He Tyr Cys Asp Pro Asp His Tyr He Arg Ala Gly Tyr 
Asn His Lys Tyr Pro Phe Arg He Cys Ser He Ala Lys Gly Thr Asp 
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Leu Het 
Ala Lys 
Thr Tyr 
Ser Sep 
Met Gly 
Ala Gin 
Phe Ser 
Phe Pro 
Lys G1u 
Ttir Ser 
Pro Phe 
. Pro Phe 
Lys Leu 
Gly Met 
Lys Ala 
Val Cys 
Glu Thr 
Phe Val 
Cys He 
Asp Tyr 
Thr Thr 
Leu Met 
Val Val 
Tyr Leu 
Leu Ala 
His Glu 
Gly Arg 
Lys Cys 
Arg Val 
Arg Pro 
Pro Gly 
Thr Glu 
His Ala 
Glu Asp 
Leu Glu 
Leu Ser 
Asn Ser 
Asn Thr 
Gly Ala 
Ala Leu 
Phe Gly 
He He 
Asp Het 
Val Thr 
Gly Gly 
Val Ser 
Lys Met 
Pro Ser 
He Arg 

SEQ ID N0:21: 
Met Phe 
Ser Leu 
His Leu 
Asp Ala 
Ser Thr 
Leu Gly 
He Ala 
Val Ser 
Tyr Ala 
Asp Lys 
His Lys 
Leu He 



Arg Phe 
Met Sep 
Thr Phe 
Tyr Arg 
Leu Ala 
Cys Tyr 
Ala Phe 
Leu Asn 
Pro Tyr 
Leu Asn 
Ser Tyr 
Phe Asn 
Thr He 
Asn Gly 
Asp Thr 
Het Leu 
Asp G1u 
Ala Pro 
Lys Asp 
Asn Asp 
Gly Asp 
Phe Leu 
Asp Val 
Tyr Asp 
Glu Ser 
Leu Ser 
Pro He 
He Glu 
Val Asp 
Leu Val 
Gin Leu 
Glu Cys 
His Val 
He Glu 
Asn Ala 
Arg Ala 
Tyr Lys 
Pro Ser 
He Gly 
Gly Asp 
Gly Gly 
Val Val 
Met Phe 
Gly Thr 
Thr Ser 
Gly Gin 
Leu Lys 
Ser Ser 
Tyr Arg 



Asp Arg 
Glu Gly 
Pro Val 
Asp Val 
Met Pro 
Ser Ala 
His Glu 
Phe Lys 
Phe Ala 
Cys He 
Phe Ala 
Gly Ser 
Leu Glu 
Ala Thr 
Leu Phe 
Lys His 
Thr Tyr 
Lys Glu 
Glu Pte 
Thr Tyr 
Leu He 
Glu Gin 
Lys Ser 
Thr Leu 
Trp Cys 
Lys He 
Ser Ala 
Val Asn 
Ala Lys 
Thr Phe 
Gly Leu 
Glu He 
Tyr Thr 
Val Leu 
Asp Phe 
Asn Val 
Ser Ala 
Tyr Val 
Thr Gly 
He Val 
Leu Het 
Phe Val 
Pro Tyr 
Thr Val 
Val Ala 
Val Ser 
A1a He 
Glu Ser 
Gly Tyr 



Asp He Ser 
Phe Phe He 
Arg Thr Tyr 
Gly Val Val 
Val Tyr Glu 
Val Ala Met 
Asp Asn Asn 
Ser He Thr 
Arg Gly Pro 
Val Thr Glu 
Leu Thr Thr 
Asn Gly Lys 
Asn Tyr Thr 
Thr Leu Val 
Ser Trp Glu 
Trp Thr Thr 
His Phe He 
Ser Leu Asn 
Glu Lys He 
Ser Het Asn 
Leu He Trp 
Gly Ser Glu 
Arg His Asp 
Lys Asp Tyr 
Leu Asp Gin 
Ser Pro Ser 
Gin Leu His 
Gin Ser Ser 
Gly Val Arg 
Ser Phe Val 
Asp Asn Glu 
Pro Ser Thr 
Asp Tyr Thr 
Asp Ala Phe 
Lys Val Leu 
Phe Asp Leu 
Leu Tyr Thr 
Asn Gly He 
Leu Gly Ser 
Gly Gly Val 
Leu He Leu 
Arg Gin Arg 
Ala Thr Asn 
Val Lys Thr 
Val Ser Glu 
Asp Asp Glu 
Lys Ser Leu 
His Ala Ser 
Lys Ser Val 



Cys Ser 
He Tyr 
Lys Lys 
Tyr Phe 
Ala Asn 
Lys Arg 
Lys Asn 
Asn Lys 
Leu Trp 
Ala Thr 
Gly Glu 
His Phe 
Het He 
Arg Lys 
He Lys 
Val Thr 
Ser Lys 
Leu Thr 
He Asn 
Gly Ser 
Gin Pro 
L^s He 
He Leu 
He Asn 
Lys Arg 
Ser He 
Gly Asp 
Val Gin 
Ser Glu 
Asn Ser 
He Leu 
Lys He 
His Thr 
He Arg 
Asp Leu 
Glu Asn 
He Glu 
Asn Ser 
Val He 
Val Ser 
Ala He 
His Val 
Pro Val 
Pro Ser 
Lys Glu 
Tyr Ser 
Asp Glu 
Lys Pro 
Asn Val 



Pro Tyr 
Lys Thr 
Glu Leu 
Leu Asp 
Leu Val 
Pro Asp 
Asn Thr 
Arg Phe 
Leu Tyr 
Ala Lys 
He Val 
Ala Glu 
Glu Asp 
He Ala 
Glu Glu 
His Gly 
Glu Leu 
Asp Pro 
Glu Val 
Tyr Gin 
Leu Val 
Arg Arg 
Tyr Val 
Asp Ala 
Thr He 
Val Ser 
Val Leu 
Leu His 
Thr Met 
Thr Pro 
Leu Gly 
Phe Leu 
Asn Ser 
Leu Lys 
Tyr Ser 
He Leu 
Ala Lys 
Phe Leu 
Ser Val 
Phe Leu 
Val Val 
Leu Sep 
Thr Thr 
Val Lys 
Glu Gly 
Gin Glu 
Ser Tyr 
Ser Leu 
Glu Glu 



Lys Ser Asn 
Asn He Glu 
Ihr Phe Gin 
Arg Thr Val 
Asn Ser His 
Gly T>ir Val 
Leu Asn Leu 
He Thr Thr 
Ser Thr Ser 
Ala Lys Tyr 
Glu Gly Ser 
Pro Leu Glu 
Leu Het Asn 
Phe Leu Glu 
Asn Glu Ser 
Leu Arg A1a 
Thr Ala Ala 
Lys Gin Thr 
Tyr Met Ser 
He Phe Lys 
Gin Lys Ser 
Arg Arg Asp 
Gin Leu Gin 
Leu Gly Asn 
Thr Het Leu 
Glu Val Tyr 
Ala He Ser 
Lys Ser Het 
Cys Tyr Asn 
Glu Val Val 
Asp His Arg 
Ser Gly Asn 
Thr Pro He 
He Asp Pro 
Pro Asp Glu 
Arg Glu Tyr 
He Ala Thr 
Gin Gly Leu 
Thr Ala Gly 
Lys Asn Pro 
Val Val He 
Lys Pro He 
Val Ser Ser 
Asp Val Asp 
Met Ala Asp 
Ala Ala Leu 
Arg Arg Lys 
He Asp Arg 
Ala 



Val Thr 
Gin Val 
Gly Asp 
Glu Thr 
He Val 
Lys Asn 
Ala Tyr 
Thr Ala 
Asp Arg 
Phe Gly 
Val Glu 
Ala Ser 



Ala Val 
Glu Pro 
Gly Asp 
Lys Pro 
Arg Leu 
Phe Thr 
Lys Phe 
Trp Ala 
Val Pro 
Lys Cys 
Ala Phe 
Lys Tyr 



Val Ser 
Thr Gin 
Glu He 
Thr Phe 
Glu Pro 
Glu Gly 
Lys Ala 
Gly Ser 
He Pro 
Ser Ser 
Asn Glu 
Asn Ser 



Val Ser Pro 
Ser Glu Asp 
Arg Glu Ala 
Tyr Val Cys 
Thr Arg Thr 
He Ala Val 
Thr Val Tyr 
Ser Tyr Thr 
Val Ser G1u 
Lys Ala Thr 
Asp Lys Asn 
Val Gly Ser 



Ser Ser Phe 
He Thr Arg 
He His Lys 
Pro Pre Pro 
Cys Pro Asp 
Val Tyr Lys 
Tyr Lys Asp 
Gin He Thr 
He Thr Asp 
Tyr Val Arg 
Pro Gin Asp 
Lys Ala Trp 



Tyr Glu 
Ser Ala 
Ser Gin 
Thr Gly 
Tyr His 
Glu Asn 
Val He 
Asn Arg 
Thr He 
Asn Asn 
Het Pro 
His Thr 
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Thr 


Asn 


Asp 


Thr 


Tyr 


Met 




Gly 


Thr 


Ser 


Val 


Asn 


Cys 




Phe 


Pro 


Tyr 


Asp 


Ser 


Phe 




Ser 


Pro 


Phe 


Phe 


Gly 


Leu 


5 


Tyr 


Ala 


Met 


Asp 


Arg 


Phe 




Leu 


Asp 


Thr 


Arg 


Ala 


Leu 




Thr 


Pro 


His 


Leu 


Thr 


Val 




Val 


Cys 


Ser 


Leu 


Val 


Lys 




Glu 


Tyr 


Ala 


His 


Asn 


Phe 


10 


Phe 


He 


Ser 


Glu 


Thr 


Asn 




G)n 


Cys 


Val 


Lys 


Glu 


Glu 




Thr 


Arg 


Tyr 


Asn 


Ser 


Ser 




Leu 


Ala 


Arg 


Gly 


Gly 


Phe 




Ser 


Leu 


Ala 


Arg 


Leu 


Tyr 


15 


His 


Ser 


Pro 


Gin 


Lys 


His 




Val 


Pro 


Va1 


Glu 


Leu 


Arg 




Val 


Glu 


Phe 


Ala 


Met 


Leu 




Val 


Asn 


Glu 


Met 


Leu 


Ala 




Asn 


Arg 


G1u 


Arg 


A1a 


Leu 


20 


Ala 


Leu 


Ala 


Ser 


Thr 


He 




Gly 


Asp 


Va1 


lie 


Ser 


Val 




Arg 


He 


lie 


Leu 


Gin 


Asn 




Cys 


Tyr 


Ser 


Arg 


Pro 


Leu 




Thr 


Va1 


Glu 


Gly 


Gin 


Leu 


25 


Asp 


Leu 


Leu 


Glu 


Pro 


Cys 




Giv 


His 


His 


Tyr 


Val 


Tyr 




Ala 


Val 


His 


Asp 


Val 


Gly 




Thr 


Leu 


Leu 


Lvs 


Asp 


Arg 




Aso 


Glu 


Leu 


Arg 


Asp 


Thr 


30 


Arc 


Asn 


Gin 


Met 


His 


Ser 




Gin 


Tvr 


ASD 


Ser 


Gly 


Thr 




Gin 


Giv 


Leu 


Gly 


Thr 


A1a 




Ala 


Thr 


Giv 


Ala 


Leu 


Leu 




Ser 


Asn 


Pro 


Phe 


Gly 


A1a 


35 


Leu 


Val 


Ala 


Ala 


Phe 


Phe 




Ser 


Pro 


Met 


Lys 


Ala 


Leu 




Leu 


Pro 


G1u 


Gly 


Met 


Asp 




Thr 


Pro 


He 


Glu 


Glu 


He 




Asn 


Ser 


Gly 


Phe 


Asp 


Pro 


40 


Lys 


Tyr 


Met 


Thr 


Leu 


Val 




Arg 


Lys 


Lys 


Asn 


Lys 


Thr 




Leu 


Ala 


Leu 


Aro 


Asn 


Ara 




Val 


Thr 


Gly 


Val 






45 


SEQ ID NO: 22: 












Met 


Arg 


Gin 


Gly 


Ala 


A1a 




Leu 


Leu 


Gly 


Leu 


Thr 


Leu 




Ser 


Pro 


Giv 


Thr 


Pro 


Giv 




Gly 


Pro 


Ala 


Thr 


Pro 


Ala 


50 


Asp 


Thr 


Lys 


Pro 


Lys 


Lys 




Arg 


Pro 


Ala 


Giv 


Aso 


Asn 




Arg 


Glu 


His 


Leu 


Am 


Asp 




Tyr 


Val 


Cys 


Pro 


Pro 


Pro 




Pro 


Arg 


Arg 


Cys 

W J w 


Pro 


Thr 


55 


He 


Ala 


Val 


Val 


Phe 


Lys 




Thr 


Met 


Tyr 


Tyr 


Lys 


Asp 




Arg 


Tyr 


Ser 


Gin 


Phe 


Met 




Phe 


Glu 


Glu 


Va1 


He 


Asp 




Thr 


Ala 


Lys 


Tyr 


Val 


Arg 


60 


Asp 


Asp 


His 


Glu 


Thr 


Asp 




Arg 


Thr 


Ser 


Arg 


Gly 


Trp 




Arg 


Val 


G1u 


Ala 


Phe 


His 




Glu 


Glu 


Val 


Asp 


Ala 


Arg 



Val Ala Gly Thr Pro Gly Thr Tyr Arg Thr 
He He Glu Glu Val Glu Ala Arg Ser He 
Gly Leu Ser Thr Gly Asp He He Tyr Met 
Arg Asp Gly Ala Tyr Arg Glu His Ser Asn 
His Gin Phe Glu Gly Tyr Arg Gin Arg Asp 
Leu Glu Pro Ala Ala Arg Asn Phe Leu Val 
Gly Trp Asn Trp Lys Pro Lys Arg Thr G1u 
Trp Arg Glu Val Glu Asp Val Val Arg Asp 
Arg Phe Thr Met Lys Thr Leu Ser Thr Thr 
Glu Phe Asn Leu Asn Gin He His Leu Ser 
Ala Arg Ala He He Asn Arg He Tyr Thr 
His Val Arg Thr Gly Asp He Gin Thr Tyr 
Val Val Val Phe Gin Pro Leu Leu Ser Asn 
Leu Gin Glu Leu Val Arg Glu Asn Thr Asn 
Pro Thr Arg Asn Thr Arg Ser Arg Arg Ser 
Ala Asn Arg Thr He Thr Thr Thr Ser Ser 
Gin Phe Thr Tyr Asp His He Gin Glu His 
Arg He Ser Ser Ser Trp Cys Gin Leu Gin 
Trp Ser Gly Leu Phe Pro He Asn Pro Ser 
Leu Asp Gin Arg Val Lys Ala Arg He Leu 
Ser Asn Cys Pro Glu Leu Gly Ser Asp Thr 
Ser Met Arg Val Ser Gly Ser Thr Thr Arg 
He Ser He Val Ser Leu Asn Gly Ser Gly 
Gly Thr Asp Asn Glu Leu He Met Ser Arg 
Val Ala Asn His Lys Arg Tyr Phe Leu Phe 
Tyr Glu Asp Tyr Arg Tyr Val Arg Glu He 
Met He Ser Thr Tyr Val Asp Leu Asn Leu 
Glu Phe Met Pro Leu Gin Val Tyr Thr Arg 
Gly Leu Leu Asp Tyr Ser Glu He Gin Arg 
Leu Arg Phe Tyr Asp He Asp Lys Val Val 
Ala He Met Gin Gly Met Ala Gin Phe Phe 
Gly Gin Ala Val Gly His Val Val Leu Gly 
Ser Thr Val His Gly Phe Thr Thr Phe Leu 
Leu Ala Val Gly Leu Leu Val Leu Ala Gly 
Ala Tyr Arg Tyr Val Leu Lys Leu Lys Thr 
Tyr Pro Leu Thr Thr Lys Gly Leu Lys Gin 
Pro Phe Ala Glu Lys Pro Asn Ala Thr Asp 
Gly Asp Ser Gin Asn Thr Glu Pro Ser Val 
Asp Lys Phe Arg Glu Ala Gin Glu Met He 
Ser Ala Ala G1u Arg Gin Glu Ser Lys Ala 
Ser Ala Leu Leu Thr Ser Arg Leu Thr Gly 
Arg Gly Tyr Ser Arg Val Arg Thr Glu Asn 



Arg Gly Cys Arg Trp Phe Val Val Trp Ala 
Gly Val Leu Val Ala Ser Ala Ala Pro Ser 
Val Ala Ala Ala Thr Gin Ala Ala Asn Gly 
Pro Pro Ala Pro Gly Pro Ala Pro Thr Gly 
Asn Lys Lys Pro Lys Asn Pro Pro Pro Pro 
Ala Thr Val Ala Ala Gly His Ala Thr Leu 
He Lys Ala Glu Asn Thr Asp Ala Asn Phe 
Thr Gly Ala Thr Val Val Gin Phe Glu Gin 
Arg Pro Glu Gly Gin Asn Tyr Thr Glu Gly 
Glu Asn He Ala Pro Tyr Lys Phe Lys Ala 
Val Thr Val Ser Gin Val Trp Phe Gly His 
Gly He Phe Glu Asp Arg Ala Pro Val Pro 
Lys He Asn Ala Lys Gly Val Cys Arg Ser 
Asn Asn Leu Glu Thr Thr Ala Phe His Arg 
Met Glu Leu Lys Pro A1a Asn Ala Ala Thr 
His Thr Thr Asp Leu Lys Tyr Asn Pro Ser 
Arg Tyr Gly Thr Thr Val Asn Cys He Val 
Ser Val Tyr Pro Tyr Asp Glu P^e Val Leu 
* 0^ 
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Ala Thr 
Gly Ser 
Val Asp 
Ala Pro 
Trp Asp 
Gin G1u 
Phe Ser 
Tyr Pro 
Arg Asp 
He Lys 
He Ala 
Arg Glu 
Pro Pro 
Thr Ser 
Gin Arg 
Glu Leu 
Asn Pro 
Arg Met 
Ala Asp 
Gly Ala 
Gly Pro 
Thr Arg 
Thr Phe 
Gin Leu 
Asn He 
Thr Arg 
Gin Arg 
Val He 
Phe Phe 
Het Gly 
Phe Het 
Ala Gly 
Gin Ser 
Lys Asn 
Asp Phe 
Het Ala 
Lys Gly 
Arg Lys 
Asp Ala 



Gly Asp 
His Thr 
Gly Phe 
Thr Thr 
Trp Val 
Val Asp 
Ser Asp 
Leu Ser 
Ala Het 
Val Gly 
Tyr Gin 
His Leu 
Pro Gly 
Ser He 
His Val 
Gin Asn 
Asn Ala 
Leu Gly 
Asn Val 
Cys Tyr 
Leu Val 
Asp Ala 
Gly Gly 
Ser Arg 
Thr Met 
His Glu 
Arg Asn 
His Ala 
Glu G1y 
He Val 
Ser Asn 
Leu Ala 
Asn Pro 
Pro Thr 
Asp Glu 
Leu Val 
Thr Ser 
Arg Arg 
Asp Glu 



Phe Val 
Glu His 
Tyr Ala 
Arg Asn 
Pro Lys 
Glu Met 
Ala He 
Arg Val 
Asp Arg 
Gin Pro 
Pro Leu 
Arg Glu 
Ala Ser 
Glu Phe 
Asn Asp 
His Glu 
He Ala 
Asp Val 
He Val 
Ser Arg 
Glu Gly 
He Glu 
Gly Tyr 
A1a Asp 
Leu Glu 
He Lys 
Gin Leu 
Asp Ala 
Het G1y 
Gly Gly 
Pro Phe 
Ala Ala 
Met Lys 
Asn Pro 
Ala Lys 
Ser Ala 
Ala Leu 
Asn Thr 
Asp Asp 



Tyr Met 
Thr Ser 
Arg Asp 
Leu Leu 
Arg Pro 
Leu Arg 
Ser Thr 
Asp Leu 
He Phe 
Gin Tyr 
Leu Ser 
Gin Ser 
Ala Asn 
Ala Arg 
Met Leu 
Leu Thr 
Ser Ala 
Met A1a 
Gin Asn 
Pro Leu 
Gin Val 
Pro Cys 
Val Tyr 
He Thr 
Asp His 
Asp Ser 
His Asp 
Asn Ala 
Asp Leu 
Val Val 
Gly Ala 
Phe Phe 
Ala Leu 
Asp Ala 
Leu Ala 
Het Glu 
Leu Ser 
Asn Tyr 
Leu 



Ser Pro 
Tyr Ala 
Leu Thr 
Thr Thr 
Ser Val 
Ser Glu 
Thr Phe 
Gly Asp 
Ala Arg 
Tyr Leu 
Asn Thr 
Arg Lys 
A1a Ser 
Leu Gin 
Gly Arg 
Leu Trp 
Thr Val 
Va1 Ser 
Ser Met 
Val Ser 
Gly Glu 
Thr Val 
Phe Glu 
Thr Val 
Glu Phe 
Gly Leu 
Leu Arg 
Ala Met 
Gly Arg 
Ser Ala 
Leu A1a 
Ala Phe 
Tyr Pro 
Ser Gly 
Glu Ala 
Arg Thr 
Ala Lys 
Thr Gin 



Phe Tyr 
Ala Asp 
Thr Lys 
Pro Lys 
Cys Thr 
Tyr Gly 
Thr Thr 
Cys He 
Arg Tyr 
Ala Asn 
Leu Ala 
Pro Pro 
Va1 Glu 
Phe Thr 
Val Ala 
Asn Glu 
Gly Arg 
Thr Cys 
Arg He 
Phe Arg 
Asn Asn 
Gly His 
Glu Tyr 
Ser Thr 
Val Pro 
Leu Asp 
Phe Ala 
Phe Ala 
Ala Val 
Val Ser 
Val Gly 
Arg Tyr 
Leu Thr 
Glu Gly 
Arg Glu 
Glu His 
Val Thr 
Val Pro 



Gly Tyr 
Arg Phe 
Ala Arg 
Phe Thr 
Het Thr 
Gly Ser 
Asn Leu 
Gly Lys 
Asn Ala 
Gly Gly 
Glu Leu 
Asn Pro 
Arg He 
Tyr Asn 
He Ala 
Ala Arg 
Arg Val 
Val Pro 
Ser Ser 
Tyr Glu 
Glu Leu 
Arg Arg 
Ala Tyr 
Phe He 
Leu Glu 
Tyr Thr 
Asp He 
Gly Leu 
Gly Lys 
Gly Val 
Leu Leu 
Val Het 
Thr Lys 
Glu Glu 
Het He 
Lys Ala 
Asp Met 
Asn Lys 



SEQ ID N0:23: 
Met Arg 
Leu Ala 
Ala Ala 
Asp G1u 
Pro Ala 
Asp Leu 
Cys Pro 
Pro Cys 
Val Val 
Tyr Tyr 
Ser Gin 
Glu Val 
Lys Tyr 
His Glu 
Ser Arg 
Glu Ala 
Val Glu 
Gly Asp 
His Gly 
Gly Tyr 
Val Thr 
Trp Ala 



Pro Arg 
Leu Ala 
Ala Pro 
Glu Val 
Ala Asn 
Arg Glu 
Pro Pro 
Pro Arg 
Phe Lys 
Lys Asp 
Phe Met 
Het Asp 
Val Arg 
Ser Asp 
Gly Trp 
Phe His 
Ala Arg 
Phe Val 
Glu His 
Tyr Glu 
At? Asn 
Pro Lys 



Gly Thr 
Val He 
Thr Ala 
Pro Asp 
Ala Ser 
He Lys 
Thr Gly 
Ala Pro 
Glu Asn 
Val Thr 
Gly He 
Lys He 
Asn Asn 
Met Ala 
His Thr 
Arg Tyr 
Ser val 
Tyr Met 
Thr Ala 
Arg Asp 
Leu Leu 
Arg Pro 



Pro Pro 
Ala Ala 
Asp Pro 
Glu Asp 
Val Glu 
Ala Arg 
Ala Thr 
Asp Gly 
He Ala 
Val Ser 
Phe Glu 
Asn Ala 
Met Glu 
Leu Lys 
Thr Asp 
Gly Thr 
Tyr Pro 
Ser Pro 
Tyr Ala 
Leu Ser 
Thr Thr 
Ser Val 



Ser Phe 
Ala Gly 
Ala Ala 
Gly Glu 
Ala Gly 
Asp Gly 
Val Val 
Gin Asn 
Pro Tyr 
Gin Val 
Asp Arg 
Lys Gly 
Ser Thr 
Pro Ala 
Leu Lys 
Thr Val 
Tyr Asp 
Phe Tyr 
Ala Asp 
Thr Gly 
Pro Lys 
Cys Thr 



Leu Pro 
Arg Ala 
Thr Pro 
Gly Val 
Arg Ala 
Asp Ala 
Gin Phe 
Tyr Thr 
Lys Phe 
Trp Phe 
Ala Pro 
Val Cys 
Ala Phe 
Lys Ala 
Tyr Asn 
Asn Cys 
Glu Phe 
Gly Tyr 
Arg Phe 
Arg Arg 
Phe Thr 
Leu Thr 

>03 



Leu Pro 
Ala Pro 
Ala Leu 
Ala Thr 
Thr Leu 
Thr Phe 
Glu Gin 
Glu Gly 
Lys Ala 
Gly His 
Val Pro 
Arg Ser 
His Arg 
Ala Thr 
Pro Ala 
He Val 
Val Leu 
Arg Asp 
Arg Gin 
Ala Ala 
Val Gly 
Lys Trp 



Arg Glu 
Lys Gin 
Ala Thr 
Val Ala 
Lys Trp 
Phe Arg 
Thr Glu 
Asp Ala 
Thr His 
Phe Leu 
Tyr Val 
Thr Pro 
Lys Thr 
His He 
Trp Cys 
Lys Leu 
Ser Ala 
Val Ala 
Arg Pro 
Asp Gin 
Arg Leu 
Tyr Phe 
Ser His 
Asp Leu 
Val Tyr 
Glu Val 
Asp Thr 
Gly Ala 
Val Val 
Ser Ser 
Val Leu 
Arg Leu 
Glu Leu 
Gly Gly 
Arg Tyr 
Lys Lys 
Val Met 
Asp Gly 



Val Leu 
Ala Ala 
Pro Glu 
Pro Ala 
Arg Glu 
Tyr Val 
Pro Arg 
He Ala 
Thr Met 
Arg Tyr 
Phe Glu 
Thr Ala 
Asp Asp 
Arg Thr 
Arg Val 
Glu Glu 
Ala Thr 
Gly Ser 
Val Asp 
Ala Pro 
Trp Asp 
Arg Glu 
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Val 


Asp 


G1u 


Het 


Leu 


Arg 


Ala 


Glu 


Tyr Gly Pro Ser Phe Arg Phe Ser 


Ser 


Ala 


Ala 


Leu 


Ser 


Thr 


Thr 


Phe 


Thr Ala Asn Arg Thr Glu Tyr Ala 


Leu 


Ser 


Arg 


Val 


Asp 


Leu 


Ala 


Asp 


Cys Val Gly Arg Glu Ala Arg Glu 


Ala 


Val 


Asp 


Arg 


He 


Phe 


Leu 


Arg 


Arg Tyr Asn Gly Thr His Val Lys 


Val 


Gly 


Gin 


Val 


Gin 


Tyr 


Tyr 


Leu 


Ala Thr Gly Gly Phe Leu He Ala 


Tyr 


G1n 


Pro 


Leu 


Leu 


Ser 


Asn 


Ala 


Leu Val Glu Leu Tyr Val Arg Glu 


Leu 


Val 


Arg 


Glu 


Gin 


Thr 


Arg 


Arg 


Pro Ala Gly Gly Asp Pro Gly Glu 


A1a 


Ala 


Thr 


Pro 


Gly 


Pro 


Ser 


Val 


Asp Pro Pro Ser Val Glu Arg He 


Lys 


Ttir 


Thr 


Ser 


Ser 


Val 


Glu 


Phe 


Ala Arg Leu Gin Phe Thr Tyr Asp 


His 


He 


Gin 


Arg 


His 


Val 


Asn 


Asp 


Het Leu Gly Arg He Ala Thr Ala 


Trp 


Cys 


Glu 


Leu 


Gin 


Asn 


Arg 


Glu 


Leu Thr Leu Trp Asn Glu Ala Arg 


Arg 


Leu 


Asn 


Pro 


Gly 


Ala 


lie 


Ala 


Ser Ala Thr Val Gly Arg Arg Val 


Ser 


Ala 


Arg 


Het 


Leu 


Gly 


Asp 


Val 


Met Ala Val Ser Thr Cys Val Pro 


Va1 


Ala 


Pro 


Asp 


Asn 


Val 


He 


Met 


Gin Asn Ser He Gly Val Ala Ala 


Arg 


Pro 


Gly 


Thr 


Cys 


Tyr 


Ser 


Arg 


Pro Leu Val Ser Phe Arg Tyr Glu 


Ala 


Asp 


Gly 


Pro 


Leu 


Val 


Glu 


Gly 


Gin Leu Gly Glu Asp Asn Glu He 


Arg 


Leu 


Glu 


Arg 


Asp 


Ala 


Leu 


Glu 


Pro Cys Thr Val Gly His Arg Arg 


Tyr 


Phe 


Thr 


Phe 


Gly 


Ala 


Gly 


Tyr 


Val Tyr Phe Glu Glu Tyr Ala Tyr 


Ser 


His 


G1n 


Leu 


Gly 


Arg 


Ala 


Asp 


Val Thr Thr Val Ser Thr Phe He 


Asn 


Leu 


Asn 


Leu 


Thr 


Met 


Leu 


Glu 


Asp His Glu Phe Val Pro Leu Glu 


Val 


Tyr 


Thr 


Arg 


Gin 


Glu 


lie 


Lys 


Asp Ser Gly Leu Leu Asp Tyr Thr 


Glu 


Val 


Gin 


Arg 


Arg 


Asn 


Gin 


Leu 


His Ala Leu Arg Phe Ala Asp He 


Asp 


Thr 


Val 


He 


Lys 


Ala 


Asp 


Ala 


His Ala Ala Leu Phe Ala Gly Leu 


Tyr 


Ser 


Phe 


Phe 


Glu 


Gly 


Leu 


Gly 


Asp val Gly Arg Ala Val Gly Lys 


Val 


Val 


Met 


Gly 


He 


Val 


Gly 


Gly 


Val Val Ser Ala Val Ser Gly Val 


Ser 


Ser 


Phe 


Leu 


Ser 


Asn 


Pro 


Phe 


Gly Ala Leu Ala Val Gly Leu Leu 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Ala 


Ala 


Phe Phe Ala Phe Arg Tyr Val Met 


Arg 


Leu 


Gin 


Arg 


Asn 


rro 


net 


1 we 

Lys 


Ala Lni 1 yr rro lcu inr inr Lys 


Glu 


Leu 


Lys 


Ser 


Asp 


Gly 


Ala 


Pro 


Leu Ala Gly Gly Gly Glu Asp Gly 


Ala 


Glu 


Asp 


Phe 


Asp 


Glu 


Ala 


Lys 


Leu Ala Gin Ala Arg Glu Met He 


Arg 


Tyr 


Met 


Ala 


Leu 


Val 


Ser 


Ala 


Het Glu Arg Thr Glu His Lys Ala 


Arg 


Lys 


Lys 


Gly 


Thr 


Ser 


Ala 


Leu 


Leu Ser Ala Lys Val Thr Asp Ala 


Val 


Het 


Arg 


Lys 


Arg 


Ala 


Arg 


Pro 


Arg Tyr Ser Pro Leu Arg Asp Thr 


Asp 


Glu 


Glu 


Glu 


Leu 










SEQ ID HO: 24 

















10 



15 



20 



25 



3D 



35 

GCTGTTCAGA TTTSACTTAG AYKM*CKTG YCC 33 
SEQ ID N0:25: 

40 G7TITACAAGA AGWCATCGT GCCNTAYATN TTYAA 3S 
SEQ ID N0:26: 

GTGTACAAGA AGAACATCGT GCC 23 

45 SEQ ID N0:27: 

AACATGTCTA CAATCTCACA RTTKACNGDJ GT 32 



50 



SEQ ID »0:26: 

AACATGTCTA CAATCTCACA 20 
SEQ ID NO: 29: 

AATAACCTCT TTACQQCCCA AATTCARTWY XKTAYGA 28 



SEQ ID M0:30: 

55 CCAACGAGTG TGATCTCAGC CATTTAYGGN AARCCNGT 38 
SEQ ID N0:31: 

CCAACGAGTG TGATGTCAGC C 21 

60 SEQ ID N0:32: 

TGaACTCGC GACCTCTAGT CACCTTYAAR TTYRTNAA 38 

SEQ ID N0:33: 
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TGCTACTCGC GACCTCTAGT CACC 24 
SEQ 10 N0:34: 

ACCGGAGTAC AGTTCCACTC TYTTRAARTC DATRTT 36 

5 

SEQ ID N0:35: 

TCTCACCTTG ACATGAiSGCC A 21 
SEQ ID N0:36: 

10 rrraACCTQG AGAaATGrr YHGNGARTAY AA 32 
SEQ 10 H0:37: 

GCTtTQGGTC TAGTAGTrKT AnCYCTTlAA CAT 33 

15 SEQ ID N0:38: 

TCTCG&VACA TQCTCTCCAG RTCRAAMACR TT 32 



20 



SEQ ID N0:39: 

ACCTTCATCA AAAATCCOT NGGNQGNATG YT 32 
SEQ ID N0:4D: 

TQSACTTACA QGACTCGAAC KACNGTNAAY TC 32 



ICS' 
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Claims 

What is claimed is: 

5 1. An isolated polynucleotide with a region encoding a Glycoprotein 6 of a herpes virus, the 

polynucleotide comprising a sequence at least 65% identical to nucleotides 36 to 354 of SEQ. ID 
NO:1 or SEQ. ID NO:3. 

2. An isolated poiynucieotide comprising a fragment of at least 50 consecutive nucleotides of the 
1 0 Glycoprotein B encoding region of the polynucleotide of claim 1 . 

3. An isolated polynucleotide with a region encoding a Glycoprotein B of a herpes virus, the 
polynucleotide comprising a sequence selected from the group consisting of. a sequence of 35 
nucleotides at least 74% identical to oligonucleotide SHMDA (SEQ. ID NOr41): a sequence of 30 

15 nucleotides at least 73% identical to oligonudotide CFSSB (SEQ. ID NO:43): a sequence of 29 

nucleotides at least 72% identical to oligonucleotide ENTTFA (SEQ. ID NO:45): and a sequence of 
35 nucleotides at least 80% identical to oligonucleotide DNIQB (SEQ. ID NO:46). 

4. An isolated polynucleotide comprising a fragment of at least 50 consecutive nucleotides of the 
20 Glycoprotein B encoding region of the polynucleotide of claim 3. 

5. The polynucleotide of claim 1 or claim 2, wherein said herpes virus is capable of infecting 
primates. 

25 6. The polynucleotide of claim 1 or claim 2. wherein said herpes virus is RFHV1 . RFHV2. or KSHV. 

7. An isolated polynucleotide comprising a linear sequence of at least 21 nucleotides identical to a 
linear sequence between nucleotides 36 to 354 irwlusive of SEQ. ID NOS:1 , 3. or 92 or anywhere 
witiiin SEQ. ID NO:96; but not in SEQ. ID NO:98. 

30 

8 Tbe isolated polynucleotide of daim 7. comprising a linear sequence essentially identical to 
nudeotides 36 to 354 of SEQ. ID N0;1. SEQ. ID N0;3. or SEQ. ID NO:96, or at least 200 
consecutive nucleotides in SEQ. ID N0.92. 

35 9. An isolated polypeptide er>coded by the polynudeotide of daim 2. 

10. An isolated polypeptide, comprising a linear sequence of at least 17 amino adds essentially 
identical to a sequence between amino adds 13 to 116 rrvdusive of SEQ. ID NO S:2, 4. or 97 or 
anywhere within SEQ. ID NO:94; but not in SEQ. ID N0.99. 
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11. The isotated polypeptide of daim 1 0. consisting essentially of a polypeptide having the sequence 
shown in SEQ. ID N0:2, SEQ. ID N0:4. SEQ. ID NO:94, or SEQ. ID NO:97 

12. A fusion polypeptide comprising a linear sequence of at least 17 amino adds identical to that 
comprised in the polypeptide according to daim 10. 

13. The isolated polypeptide of daim 10. implicated in the binding or fusion of herpes virus with a 
mammalian cell. 

14. The isolated polypeptide of daim 10. which is glycosylated. 

1 5. The isolated polypeptide of daim 1 0. which Is not glycosylated. 

16. The isolated polypeptide of daim 10. which is immunogenic. 

17. The isolated polypeptide of daim 10. comprising a sequence selected from the group consistir*g 
of SEQ. ID NOS:67.76. 

18. An isolated polynudeotide encoding the polypeptide of daim 1 0. 

19. A non-naturaliy occurring polynudeotide encoding the polypeptide of daim 1 0. 

20. A polynudeotide encoding a fusion polypeptide, comprising the polynudeotide of daim 2 joined 
directly to 8 second polynudeotide encoding a polypeptide. 

21. A recombinant cloning vector comprising a polynucleotide sequence encoding a linear sequence 
of at least 17 amino adds identical to that comprised in the polypeptide according to claim 10. 

22. A recombinant expression vector comprising a polynudeotide sequence ertcoding a linear 
sequence of at least 17 amino adds identical to that comprised in the polypeptide according to 
daim 10. operatively linked to a control polynucleotide sequence. 

23. A recombinant cloning vector comprising a linear sequence of at least 21 nudootides identical to 
a linear sequerice comprised in the polynudeotide of daim 7. 

24. A host cell transfonned by the polynudeotide of daim 1 8 or daim 1 9. or by the vector of daim 2 1 . 
daim 22, or daim 23. 
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25. A monocional or isolated polydonal antibody specific for a Glycoprotein B polypeptide encoded in 
said encoding region of the polynucleotide of daim 1 . 

26. A nrtonodonal or isolated polydonal antibody spedfic for the polypeptide of daim 11 , but not for a 
polypeptide having an amino add sequence of any of SEQ. ID NOS:30-41 .. 

27. The antibody of daim 26. which is a monodonal antibody. 

28. The antibody of daim 26. which is an isolated polydonal antibody. 

29. A vacdne comprising the polypeptide of daim 9 or daim 10 in a phanmaceutically compatible 
exdpient. 

30. The vacdne of daim 29, also comprising an adjuvant. 

31 . A method of treating a herpes virus infection, comprising administering the vacdne of daim 29. 

32. A vacdne comprising the polynudeotide of daim 2 or daim 7 in a pharmaceutically compatible 
exdpient. 

33. The vacdne of claim 32. which is a live virus or viral expression vector. 

34. A method of treating a herpes vims infection, comprising administering the vacdne of daim 32. 

35. A vacdne comprising the antitxxJy of claim 25 or daim 26 in a pharmaceuticaliy compatible 
exdpient. 

36. A method of treating a herpes vinjs infection, comprising administering ttw vacdne of daim 35. 

37. An oligonucleotide essentially identical to an oligonudeotide selected from the group consisting of 
SEQ. ID NOS:24-63. SEQ. ID NOS:77-78. and SEQ. ID NOS:80-B0. 

38. A method of obtaining an amplified copy of a polynudeotide encoding a Glycoprotein B. 
comprising the steps of 

a) contacting the polynudeotide vwth the oligonudeotide of daim 37; and 

b) elongating oligonudeotide that has formed a duplex with the polynudeotide. 

39. The method of daim 38. wt«rein said amplification reaction is a polymerase chain reaction 
(PGR). 
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40. The method of claim 39. wherein said PGR comprises repeated cycles of annealing and 
elongating, and the annealing is conducted at a temperature of at least 500C. 

41. The method of claim 38. wherein the polynucleotide which is amplified Is first obtained from a 
biological sample taken from an individual affected with a disease featuring fibroblast proliferation 
and collagen deposition. 

42. A method of detecting viral DIMA or RNA in a sample, comprising the steps of: 

a) contacting the DMA or Rh4A in the sample with a probe comprising the polynucleotide of 
claim 2 or claim 7 under conditions that would pemnit the probe to form a stable duplex with 
at least one polynucleotide having a sequence shown in SEQ. ID NOS:1. 3, 92 or 94. but not 
with a polynucleotide having any of the sequences shown In SEQ. ID NOS:5-l3; and 

b) detecting the presence of said stable duplex formed in step a), if any. 

43. A method of detecting viral DMA or RNA in a sample, comprising the steps of. 

a) corrtactirtg the DNA or RNA in the sample with a probe comprising the polynudeotide of 
claim 2 or claim 7 urrier conditions that would pemiit the probe to fonn a stable duplex with 
a polynucleotide having the sequence showvn in SEQ. ID N0:1. and with a polynucleotide 
having the sequence shown in SEQ. ID NO:34, but not with a polynucleotide having any of 
the sequences shown in SEQ. ID NOS:5-13: and 

b) detecting the presence of said stable duplex formed In step a), if any. 

44. A method of detecting viral DNA or RNA in a sample of primate origin, comprising the steps of 

a) contacting the DNA or RNA in the sample with a probe comprising the polynucleotide of 
daim 2 or daim 7 under conditions that would permit the probe to form a stable duplex with 
at least one polynudeotide having a sequence shown in SEQ. ID N0S:1 . 3. 92 or 94. but not 
with a polynudeotide having any of the sequerx^s shown in SEQ. ID N0S:5 or 9-1 3; and 

b) detecting the presence of said stable duplex formed in step a), if any. 

45. The method of daim 42 further comprising conducting an amplification reaction on the DNA or 
RNA of the sample prior to being contacted with the probe. 

46. A method of detecting viral DNA or RNA in a sample, comprising the steps of 

a) conducting an amplification reaction on a polynudeotide in the sample using the 
oligonudeotide of daim 37 as a primer in tt>e reaction; arwJ 

b) detecting the presence of amplified copies of the polynudeotide. if any. 



wo 97/12042 



PCT/liS96/15702 



10 



47. An isolated polynucleotide capable of forming a stable duplex with a second polypeptide 
comprising a sequence selected from the group consisting of SEQ. ID NOS:1. 3. 92. and 94. 
and their respective complementary sequences, under conditions wherein the second 
polynucleotide is capable of forming a stable duplex with at least one polynucleotide having 
a sequence shown in SEQ. ID NO:1. 3. 92 or 94, but not with a polynucleotide having a 
sequence of any of SEQ. ID N0S:5-1 3. 

46. The isolated polynucleotide of daim 47, the nucleotide &equer>ce of which Is contained tn the 
genome of a naturally occurring virus. 

49. An isolated polypeptide comphsing a linear sequence of at least 25 amino acids encoded within 
the polynucleotide of daim 47. 

50 A method for detecting infection of an individual by a herpes virus, comprising detecting viral DMA 
15 or RNA in a biological sample obtained from tfie individual, wtwrein the detecting of viral DMA or 

RNA is by the method of daim 42. daim 44, or daim 46. 

51 . A diagnostic kit for detecting a herpes virus potynudeotide in a biological sample, comprising a 
reagent in suitable packaging, wherein the reagent comprises the polynudeotide of daim 2or 

20 daim 7. 

52. A diagnostic kit for detecting a herpes vims polynudeotide in a biological sample, comprising a 
reagent in suitat}le packaging, wt>erein the reagent comprises trie oligonucleotide of daim 37. 

25 53. A method of detecting infection of an individual by a herpes vims, comprising the steps of: 

a) contactirig antitxxty from a sample obtair>ed from the ir>drviduat with the polypeptide of daim 
25 or daim 26 urxjer conditions that pemirt the formation of a stat>le antigen-antitxxly 
complex: and 

b) detecting said stable complexes formed in step a), if any. 

30 

54. A diagnostic kit for detecting an anti-herpesvirus antit)ody present in a biological sample, 
comprising a reagent in suitable packaging, wherein the reagent comprises the polypeptide of 
daim 25 or daim 26. 

35 55. A method of detecting infection of an individual by a herpes virus, comprising the steps of: 

a) contacting a polypeptide from a sample obtair>ed from the individual with the antibody of 
daim 9 or daim 1 0 under conditions that permit the formation of a stable antigerv-antibody 
complex: and 

b) detecting said stable complexes formed in step a), if any. 

no 
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56. A diagnostic kit for detecting a herpes virus polypeptide present in a biological sample, 
comprising a reagent in suitable packaging, wherein the reagent comprises the antibody of claim 
9 or claim 10. 

5 

57. A method of determining whether a pharmaceutical candidate Is useful for treating gamma herpes 
infection, comprising the steps of: 

a) contacting the polypeptide of daim 9 or daim 1 0 with the phanmaceutical candidate; and 

b) detemiining whether a biochemical function of the polypeptide is altered by the 
1 0 phamnaceutical candidate. 

58. The method of daim 57. wherein the biochemical function of the polypeptide determined in step 
b) is the binding of the polypeptide to the surface of a mammalian cell. 

15 59. A method of produdng a Glycoprotein B polypeptide, comprising expressing the polynudeotide of 
daim 2 or daim 7 in a eukaryotic cell. 

60. A method of produdng a polynudeotide comprising a region encoding herpes vims Glycoprotein 
B. comprising replicating the recombinant doning vector of daim 21 or daim 23. 

20 
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Figure 1(a) 



vykknivptifkvrrt'[|]kia 



V Y K K 

(NIVPA> I gt.gtacaagaagaacatcgtgccnt:ayacnccyaa 



N IVPY IFKVRRT[E]K lA 



RFHV 



KSHV ACCTCTGTCACGGTCTACAGGGGCTT^^ 

TSVTVYROHTEgjAITNKYE X) 

cgaactgccccaggcggtagt (<GLTEB) 

{<TNKyB) ggtagtggttgttcatacttg 
gggcttgacagagtccgccat (GLTEA>) 

(YELPA>) acaagtatgaactc- 



60 



(<GhrrEBI gcgccatactgtcttcgtcgt 

(<AAITB) tcgtcgttagtgtttgtttat 
(GMrEA>) gcggtatgacagaagcagcaa 

(KYEIA>t aacaaatatgagatc- 
TSVTVYRa[gTE[5AlTNKYE[Tl 
^CATCTtSTCACGGTCTACCQCOaTATOACAGAAGCAGCAAT^ 120 



120 



RFHV 
KSHV 



RFHV 

KSHV TCCA- 



-cccagg (KY£IA>) 
PRPVPLYEISHMDSTYQCFS 

CCCAGGCCCGTGCCTCTCTACGAGATCAGTCACATGaACACSCACCTACCAGTGCTTTAGT 180 

CCQAOACCCgTGCCACTCTATgAGATAAOCCACATGGACAGCACCT ATCAGTGCTTTAGT 180 

p R P V P t~Y i I S B M D S T Y QC F S 

-ccgagac (YELPA>) 

agacccgtgccacTiccatgarathagycayatgga ( SHMDA>) 

{<CFSSS) acraarrci- 



(<TDRDBI atgcaagcggctagccccact 
SM KflirgvNGVENTFTDRDDVN 
TCCATaAAAATTGTAGTGAACGGAOTCGAAAATACGTrCACCGATCGGGATGACCTAAAC 240 



.TGAAGQTAAATOTCAACOOOQTAQAAAACACATTTACTGACASAGACaATGTTAAC 240 



K[V][gVNOVENTrTDRDDVN 

cttccattcacagtcgcccca (<VNVNB) 

KTFTDB) gtgtaaatgactgtctctgct 
-agntacttctaacaacacttg KCFSSB) 

(EirrFA>) gtcaacggagtagaraayacnTLtyacnaa 
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Figure Kb) 



(<VEGLB) gcagcttccagactgactgtt 
(VEG1A» cccgtcgaaggtctaactgac {PVLYA>) age 

^ccacagrattcctccaaccag (TVnA» (SQPVA» ggtactttagc- 

^ KDNIQB) tgnctrttrtaagtttccacgaaaccg- 

-caaccagtactgtactct (PVLYA>) , ^ » v tr=l t t 

ZIJjl ^^^^I^G^G^CC^C TaGTTTCCCOGCATATACAOAO™ 360 
QPVHll^lEEPGWFPG I Y R V [»J T T 

* ,^«iTTi^i (<TVNCB) tgntgn- 

-cagccggtca (SQPVA>) 

-gtcggtca (<DNIOB) 

VHCEIVDM 
GTAAACTOTaXGATTOTAGACATGTT 
•* «•♦••••♦♦♦••♦•***♦•**♦ 

GTCAACTGTOXGATTGTAGACXTGTT 
VNCE IVDM 

-canttracactctaacatctgtacaa (<TVNCB) 



RFHV 
KSHV 



RFHV 
KSHV 
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Figure 2 



Glycoprotein B 
encoding region 



FRFDA MVPA TVNCA FAYDA PfGKA CYSRA 



FREYA 



TVNCB 



NtDFB NVFDB FREYB 



_: -430 bp 
t -360 Dp 



--600 bp t 



-450 bp * 



-330 DP 



- 1700 bp 



-1060 bp * 



-1600 bp 



* -1000 bp 
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Figure 3fa) 



KSHV 

HVS 

bHV4 

inHV66 

EBV 

hHV6 

WVSl 



S igna 1 -Pept 7 de Doma i n 

KTPRSfehA 

MVPMCHU 
KYYKTllf 
MYP7VKSH 
MTKRRVLS 
MSKMWtF 
HRQGAARG 



KSHV 

HVS 

bHV4 

D1HV66 

EBV 

hHV6 

hHVSl 



TlJGrTVILLVCFCAGAAHS ^GDTFQTSSSPTPPGSSSKAPTXPGEE- 



^AiaKVCSFN 



PSISSTCSTTTSTSKPsfi^ - 



Rwffi^ilTlXC^l^^ HLYVCQPTTLRQPSDKTPAQDAPTFrPPPLSTNTN 

VYVLLAALACRLGAOTPE DPAPPATTVQPTATRQ " ■ 

Crajnr5SwL^^L\ ASAAPSSPGTPGVAAATQAAN 



KSHV 



- ASGPKS VDFYQFRVCS AS I T - GEL 



Muc YQEYFKYRVCSASTT-GELFRFDLDR . 

If Q^^virLXS^gLGAQTPEQ^ 

fflsVl KPKNPPPPRPAmNATVAAGHATLREHLRDIKAEMTOANFY- - -VCPPPTG-ATWOFEOPRRC 



N} VPA> 

RFHVMn VRRYIKIATSVTVYRGM--TEAAITNKYETPRPVPLY 
gpHvEE VRRYRKVATPVTLYRGH- -TDAAITNKYEIPRPVPLY 

KSHV PDTKDKY-HQEGILLVYKKNIVPHIFKVRRYRKIATSVTVYRGL- -TESAITNKYELPRPVPLY 

HVS PSTEDKV-HKEGILLVYKKNIVPYIFKVRRYKKITTSVRIFfHanKEGVAimV^L^ 

bHV4 PDTKDKK - HVEGI LLVLXKN I VPY I FK VRKYRKI ATS VTVYRGW - - SOAAVTNRDOI SRAl P YN 

SSvfifl PsImSc.HVEGILLVraNI^ -TTSSVTGKFEMATOAHEW 

EBV PSFGTRENHTEGLLHVFKD^ ' ^ADSVTNRHEEKFSVDSY 

hHV6 SPYKSNAffisEGFFIIYKTNIEimPVRTYKKELTFQSS -GWYFLDRTVMGLAMPVY 

wis VI PTRPEGQNnEG I AVVFKEN I AP YKFKATWY YKDVTVSQVWFGH - R YSQFMG I FEORAP VPFEE 
*♦ * * * * * 



RFHVMn EISHMDSTYCCFSSHKIVVNGVENTFTDROOVNKTVFLQPVEGLTDNIQRYFSQ- -PVLYSEPG 

RFHVmS EISHmSTYSFSsESciVVNGVEfn'rrGRDOVNKSVFW^ 

KSHV EIsfflsTYSFSsKvNVNGVEKlTTBRDDVMmFLQPVEGLTON^ 



HVl EIDIMDKTYQCHNCMOIEVNGHLNSYYDRDGNNmDUCPVDGLTGAITRYISQ- -PKVF^^P^ 

bHV4 EISMIDRTYS^FSAKTVINGILfn^ 

Sv68 IvOTDSIYSYNSATMVVNNVRQVYVDRDGVr^TVNIRPVDG -ET-^SEPG 

EBV ETDOHDTIySyHAVkJ^^ 

hHV6 eaKlvnshaq*ysavahkrpmtvfsa^^ 

hHSVl raKINAKGfl^RSTAKY^^NLEW 
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Figure 3(b) 



RFHVMn WFPGIYRVG 
KSHv""vfpGIYRVRTTVNCEIVD 

HVS WLVOYRTRTTVNCEI VDMFARSADPYT7FVTALGDTVEVSPFCDVDN5 - • CPNAT- - - - DVLS 
bHV4 WFPGIYRVRmNCEVVDMYARSVEPnWFITALGDTIEISPFj^ 
ShV68 JwPGFYRVRTTVNCEIVDMVARSHDPYNYIATALGDSLELSPFOTFOfa^ 

EBV ffi^IwWRTRmNCLITDMMAKSNSPFDFFmrGQTVEMSPFYD^ RADS 

hHV6 P-LW-YSTSTSLN^VTEATAKAKYPFSYFALTTGEIVEGSPFFrffiSNGKHFAEPLEK-aTIL 
ffisVl RVEAFHRYGTTVNCIVEEVDARSVYPYDEFVLATGDFVYMSPFYGYREGSHTEHTSYAADRFKQ 
* ♦* ♦ * * *** 

KSHV VQVVLNHTVmSDRGTSPTPQNRIFVETGAYTl^WASESICn^AVCPLALWKTFPRSIQTTH^^ 

HVS VOIDLNHTVVDYGNRATSQOHKKRIFAKTUJYSVSWEAVNKSASVCSHVFWKSFQRAIOTEHDL 

bHV4 VWIEENHQ7VDY.ERRGHPTXD!a?IFIJa)EEYTISWKAEDRERAlCDFVI^ 

ShV68 R£VK-NYKFVDYNNRGTAPAGQSRTFI^SATYSWKTATRQTAT&^ 

Sv FHVRTNYKI VDYDNRGTNPQGERRAFLDKGTYTLSWKLENRTA- YbLQHWOTFDSTIATETGK 

hHV6 ENYTMIEDLMNG-MNGATTLVRKIAFLEKADTLF^^ 

hHSVl - VOGFYARDLTTKARATAPTTRN - - LLTTPKFTVAWDWVPKRPS VCTMTKWQE VDEHLRSEGGG 



KSHV 5FHFVANEITATFTAPLTP- • -VANFTDTYSCLTSDIPmTLNA-SKAKLASTHVPNGTVQYFKT 

HVS TYHFIANEITAGFSTVKEP- - -LANFTSDYN^LJm^ImTLED-KlARVN^fTHTPNGTAEYYQT 

bHV4 smFVANEn^FLTSNQEETEIJ^GrfTEILNCHNSTINETliE-TVKKFNICSHIRI^ 

ShV68 SYHFVANEW'ATFWTPLTE---VENFT5TYSe^^ 

EBV SIHVFTDEGTSSFVTMTTV - - -GIELPDAFKCIEEOVNK^HEKYEAVODRmQQEAlTYFrr 

Sv6 TYHFISKELTAAFVAPKES - - - LNLTDPKQTCIKDEFEKI INEVYMSDYNDTYSMNGSYQIFKT 

fflsvi 5FRFSSDAISTTFTTNLTE- - YPLSRVDL60CIGKDAROAHDRIFARRYNATHIKVGQPQYYLA 



KSHV TGGLYLVWQPMSAINL'mAQGDSGNPTSSPPPSASPKTTSASRRK 

HVS EGGMILVWPLIAIELEEAMLEATTO\rrPSAPra^ 

bHV4 NGGLFLIWOAHKPLNLSE KTNYTIERNNKT6NKSRQK- 

f*V68 DGNLYLIWQPLEHPEIEDIDEDSDPEP-TPAPPKSTRRKREAA 

EBV S«5LLU^PLTPRSLATVKNLTELTTPTSSPPSSPSPPAPSAARGSTPAAVLRRRRRDAGNAT 

hHV6 TGDLILIWQPLVOKSL- .HFLEOGSEKIRRRRDVVDVKSRHDI 

fflsVl NKFLlAYQPLLSrfTLAELYVREHLREQSRKPPNPTPPPPGAS 



FAYDA> 

KSHV RRSASTAAAGGGGS-raNLSYTQLQFAYDKLRDGINQVLEELSRAWCREQVRDNLMWYELSKINP 

HVS - - - -RSIRDVSA6SENNVFLS0IQYAYDKLR0SINNVLE£LAI7M;REQVR0TMVWY£IAKINP 

bHV4 - - - -RSVDTXTFOGAKGLSTAOVQYAYDHIJlTSHKHILEELTKTi&EOKKDNLMWY^ 

ShV68 -DNGNS-^EVSKGSENPLlTAOIQFAYDKLTTSVNNVLEELSRAtp 

EBV TPVPPTAPGKSLGTLNNPATVOIQFAYDSLRRQINRMLGDLARAVSLEQKRONMVLRELTXINP 

Sv6 :.:--'vr7^.. LYVQLQYLYDTLKDYINDALGNLAESWCLDQKRTITMLHELSKISP 

ffisvi - • - -anasverikttssiefarlqftynhiqrhvndmlgrvaiaSIelonheltlwnearklkp 
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Figure 3(c) 



IYGKA> CY5RA> 
KSHV TSVMTAIYGRPVSAKFVGDAISVTECINVDQSSVNIHKSLRTNSK DVCYARPLVTFKF 

HVS TSVKTAIYGKPVSRKALGDVISVTECINVDQSSVSIHKSLXTENN DICYSRPPVTFKF 

bHV4 VSVMAAIYa(PVA\^SHGOAFW J^S!!§SEi;XIKE 

mHvSs TSvfSAIYGKPVAARYVGOAISVTDCIYVDQSSVNIHQSU^LQHD KTTCYSRPRVTTFKF 

TTVMSSIYaSvAAKLGDVISVSK^ ETHCYSRPLVSFSF 

hHV6 SrVSEVY^ISAQLHGDVLAIsSlEVTO 

fflsvi KAIASATV^VSARmSoVMAVS^V PG/^YSRPLVSFRY 



tCSHV LNSS-NLFTGQLGARNEIILTNNQVETCKDTCEKYFITRNETLVYmAYLRTIKTTDISUNT 

HVS VNSS-QLFKGQLGARNEILLSESLVENCHOhmFFrAJCNETYHFKNYVHVETLPVNNISTU)T 

bHV4 VNsiATFROTLGTRNEILLTfmiVETCRPTAD^ 

Shv68 inst-dpltwlgprkeiilsntnietJkdesehy 

EBV INDT-mEMLffTONEIFLTXKKTEvjQATSQY^^ 

hHV6 VNSTPEVVPGOLGLDNEILLGD^ 

fflsvi edS-plvegovgennelrl^^ 

**** * * 



<FREYB 

<NWF8 <NVFDB FREYA> 

KSHV FIALNLSFIQNIDFKAIELYSSAEKRLASSVFDLETMFREYNYYT>^RLAGLREDLD^fTIDMNKE 



HVS 
bHV4 

EBV 

hHV6 

hHSVl 



FLAL NLTF I ENIDFKAVELYSSGERKU - NVFDLETMFRE YNYYAOS I SGLRKDFDNSQRNNRD 

flunltfidnidfKvelysetIr^ 

FIALNISFIENIDFmELYSSTERKLASSV 
FISuJtSLIENIOFASLELYSRDEQRAS - NVF^ 

FIRLKIOPLENADFKVLDLYSPDELSRA-NVFDLENILREYNSYKSALYTIEAKI - - - -AANTP 

nDLNITMLEDHEFVPLEVYTRHEIKDSGLLDYTEVQRRNOLHDLRFAOIDTVIH^ 

* * * * * 



Membrane-spanning doma 



KSHV RFVRDLSEIVADLGGIGKTVVNVASSVVTLCGSL\n-6FIN- 



FIKHP jGGHLHIIIVIA 



HVS RIIQDFSEILADLGSIGKVIVNVASGAFSLFGGIVTGILN FKNP ^GMFTFLLIGA 

bHV4 RLVKDLSEMMADLGDIGKVVVMTFSGIVTVF65IVGGFVS FFTNPF^TTii 1 1 TV 

ffiveS RLVQDLSDf«ADLGDIGRSVVNVVSSVVTfFSSIVT6FIK FFTNP 

EBV QFVDGLGEliOSLGSVGQSITNLVSTVGGLFSSLVSGFIS - -FFKNP 

m/e syvnginISSglgaigtglgsvisvtagalgdivggvvs flknp 

ffisvi ihada^wamfaglgaffeghgdlgravgkvvmgivggvvsavsg 



FFTNP LGGVniLLLIV 

Cggifilliigg 
gghlilvlvag 

GGGLMLILAIV 

galavgllvla 



KSHV 

HVS 

bHV4 

ii«V68 

EBV 

hHV6 

hHSVl 



-TREEIKNILLGMHQLQOE 



IILIIFML SRRTKTIAQAPVKMIYPDVDRRAPPSGGAP- • - 

VliLViUVRRTNNMSQAPIRKIYPDVEKSKSTVTP MEPmKQILLGM[5}M0^ 

VVFVVFIV5RRTNNHNEAPIKHIYPNIDKASEQENIQP LPGEEIKRILLGMHQLQQS 

IIFLVWLMRRNSQFHDAP SlL^ 

WlLVISl TRRTR0HSQQPVQHLYPGIDELAQ0HAS6EGPGINP- ISKTELQAIMLALHEQNQE 

vwiiivvfvrqrhvlSo'idmmf 
glaaaffaSyWSlqsnphkalyplttkelknptnpdasgegeegot 
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Figure 3^d) 



KSHV ERQKADDLKKSTPSVFQRTANG • LRQRLRGYKPLTQSLDISPETGE 
HVS AYKKKEEORAARPSIFRQAAETFLR-KRSGYKQISTEDKIV 

hHSVl ALVSAMERTEHKWCKK^^ 
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Figure 4 



EBV GGCGACCTGTTCCGCTTCTCCTCGGACATCCAGTGTCCC 

SHVI GGAGAATTGTTTAGATTTGATTTAGACAGAACTTGTCCA 

mHV68 GGGGAGACCTTCAGGTTTGATTTAGACAAAACATGCCCC 

bHV4 GGAGAACTATTCAGATTTGACCTTGATCAGACATGTCCA 

5 ' -gccgctcagatt:cgacctagaymanmcntgycc-3 ' 
FRFDA 256-fold 33mer> 



Figure 5 



EBV GAGGGCCTGTTGATGGTGTTTAAAGACaUiCATTATTCCCTACTCGTTTAAG 

S HV 1 G AAGG C ATTCTTTTAGTGTACAAAAAAAATATAGTTCC ATATATCTTTAAA 

cnHV6 8 GAGGGCATCTTGCTCGTGTATAAGATCAACATCGTGCCCTACATCTTCAAA 

bHV4 GAAGGCATCCTGCTGGTACTAAAAAAGAATATTGTCCCATACATCTTCAAA 

5 ' -gtgCacaagaagaacatcgtgccncayatnttyaa-3 ' 
NIVPA 64 -fold 3 2mer> 

5 ' -gtgtacaagaagaacatcgtgcc- 3 ' 
NIVPASO 23mer> 



Figure 6 



EBV 
sHVl 
mHV68 
bHV4 



ATATGGACrn-ACAGAACAAGAACrrACCGTCAACTGCCTGATAACTGACATGATGGCCAAG 
TGGGGAACTTACAGGACTCGAACTACCGTTAACTGTGAAATTGTAGACATGTTTGCTAGG 
CCTGGCTTTTATCGTGTTCGAACCACCGTTAACTGTGAAATTGTAGACATGGTGGCACGC 
CCAGGGATTTATAGAGTGAGAACAACTGTTAATTGTGAGGTTGTTGACATGTATGCCCGC 



5 ' - tggactcacaggactcgaacnacngcnaaytg- 3 ' 
TVNCA 128-fold 32mer> 



3 ' -tgntgncanctracactctaacatctgtacaa-S ' 
cTVNCB 12e-fold 32mer 



3 ' -acactctaacatctgtacaa- 5 * 
<TVNCBSQ 20mer 
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Figure 7 



EBV AACAATCCCGCCACCGTCCAGATCCAATTTGCCTACGAC 

SHVI AATAATGTGTTTCTATCACAAATACAATATGCATATGAT 

mHV6 8 AATCCGCTCATTACGGCCCAAATTCAATTTGCCTATGAC 

bHV4 AAGGGCCTGTCCACTGCCCAGGTTCAATATGCCTATGAC 



5 ' -aataacctctttacggcccaaattcartwygcncayga- 3 ' 
FAYDA 64 -fold 3 8mer> 



Figure 8 

AATCCAACCACCGTCATGTCCAGCATCTACGGTAAGGCGGTG 
AATCCAACAAG TGTTATG ACAG CAATATATGG AAAACCTGTC 
AACCCTACGAGTGTGATGTCTGCCATTTATGGAAAGCCTGTC 
AACCCAGTGAGTGTCATGGCAGCCATTTATGGGAAACCTGTG 

5 ' -ccaacgagtgcgatgccagccacttayggnaarccngt- 3 ' 
lYGKA 64 -fold 3Bmer> 

5 * -ccaacgagtgtgatgt:cagcc-3 ' 
lYGKASQ 2lmer> 



Figure 9 



EBV TGCTACTCGCGCCCCCTGGTGTCCTTCAGCTTTATCAACGAC 

SHVl TGCTATTCACGGCCTCCAGTTACATTTAAATTTGTTAACAGT 

mHVe a TGCTACTCGAGACCTAGAGTCACCTTCAAATTTATAAACAGT 

bHV4 TGTTACTCCAGACCCCTGGTCACATTTAAATTTGTGAATAGT 



5 ' -tgctactcgcgaccictagtcaccttyaartcyrcnaa- 3 ' 
CYSRA 64 -fold 38mer> 

S ' - tgctactcgcgacccctagtcacc-3 ' 
CYSRASQ 24mer> 
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Figure 10 



EBV ■ AACAT^GACTTTGCCTCCCTGGAGCTGTACTCACGGGACGAACAGCGT 

sHVl AATATTGACTTTAAAGCTGTTGAATTGTATTCAAGTGGAGAGAGAAAG 

mHV6 S AATATCGACTTCAAAACAGTAGAACTGTACTCCTCTACTGAAAGGAAA 

bHV4 AATATAGATTTCAAGACAGTGGAACTTTACAGTGAGACTGAAAGAAAG 



3 ' -ccrtadctraarttycgtcaccctgacatgaggcca-s • 
<NIDFB 4e-fold 36mer 

3 ' -cgccaccctgacacgaggcca-5 ' 
<NIDFBSO 2lmer 



Figure 11 



EBV AACGTCTTTGACCTGGAGGGCATCTTCCGGGAGTACAACTTCCAGGCGCAAAAC 

s HVl AACGTGTTTGATTTAGAGACTATGTTTAGAGAATATAACTATTACGCTCAGAGT 

mHVfi e AGCGTCTTTGATATAGAATCCATGTTTAGGGAATATAACTATTACACCTACAGC 

t>HV 4 AGTG CCCTCGACCTGG AGACGATGTTTAG AGAGTATAATTACTAC ACACAG AAG 



5 ' -CCtgacctggagactatgtCymgngartayaa-a • 
FREVA 64 -fold 32mer> 

3 ' -CacaartcyccyacrccgaLgatgtgggccLcg-S ' 
<FREYB 16-folc 33mer 

3 * - tcrcanaarccrgacctctcgcacaaggctcc- 5 ' 
<NVFDB 32 -fold 3 2mer 



Figure 12 



EBV TTCATCTCCTTCTTCAAAAACCCCTTCGGCGGCATGCTC 

SHVl ATATTAAATTTTATTAAAAATCCTTTAGGTGGCATGTTC 

mHV6 8 TTCATTAAATTCTTTACCAACCCTCTAGGGGGAATATTC 

bHV4 TTTGTCAGTTT I'l'l'CACAAAC CCC ATTGGGGGCGTGACG 



5 ' -acctccatcaaaaatccctcnggnggnatgyt - 3 ' 
GGMA 128-fold 32mer> 
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Figure 13(a) 



VRRYRKIATSVTVYRGLTE 

KSHV GGTGCGGCGCTATAGGAAAATTGCCACCTCTGTCACGGTCTACAGGGGCTTGACAGAG- - 

RFHV .A. .CA.G. .C.TA A. .A. .A C . C . . TA A- - 

bHV4 A. - .A. ,AAA A A. .G.-A. .T A . . G . G . T . CC . , - - 

SHVI A. .CA.AA.A. . C . AA CA.A. .A. .A. . . CGTA . T . TT . AT G. . - TAGAGA 

eHV2 T- .CA.AAAG G. . CATG GAC . . . CA A. . ,T.G. . GC , . . 

mHV68 AA.CA. .A. A A. .A. .AATT. .TCAAC.G. .CA. . .GGC.A-. .CA. .CACT-- 

hEBV , . .C. .CTC. . .C.CC. .G. .A-TG. . .AACA.TCTCA AT G . TAG . C . - - 



KSHV 
RFHV 
bHV4 
aHVl 
eHV2 

mHvee 

hEBV 



SAITNKYELPRPVPLYE IS 

tCCGCCATCACCAACAAGTATGAACTCCCGAGACCCGTGCCACTCTATGAGATAAG 

G.A. .A A A GA C..G T C C. 

G.A. .TG.T T.G.G. . - .TA. .AGC. . .G. .A. A. .CTATA. . . .A. . TTC 

AGGTGTT. .T. .T. .A A.GG TT.T. . .G.T. .T. . .AAA GA 

GAT A..A...C..C. CACGAGGAGCTACG ....C..C..G..C..-G. CCA 

AGTT.AG. - . .TGGT. .A.T. . . . A . GG . C . CT . AG . CC . ACGAG . GG . . AG . GG . 

GA.T. .G.G CG.C.C. . GGAGAA . TTCT . . - . TGACAG . . -C, .A.CTGA 



HMDSTYQCFSSMKVNVNGVE 

KSHV CCACATGGACAGCACCTATCAGTGCTTTAGTTCCATGAAGGTAAATGTCAACGGGGTAGA 

UPjjy T C AA. TGTA ..G..C..A..C.. 

bHV4 AATG- "a. .T. .G T. .T. . CTC . G . T . . . GCAAC . GTCA . T . .T. - . A . TCT 

SHVI TATT T.AG. .T. .C. .A. .TCAT.A. .G C. .A. .G.A. .A AA . GTT 

eHV2 GATG CA.TAT CG . .G.AC C. . C . AGGGG . . . CAC . T 

mHV68 .G. .T.T T AC . A . AG . GCC . CCA . GGTG . .A. . .AAC. . CAG 

hEBV ...G T.C..T...C AC.ACG.GG.C. . . A . G . CAAAAG . T . . . C . GAC 
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Figure 13fb) 



NTFTDRDDVNTTVFLQPVEG 

KSHV AAACACATTTACTGACAGAGACGATGTTAACACCACAGTATTCCTCCAACCAGTAGAGGG 

RFHV ...T..G..C..C..TC.G..T-.C..A AA..C T G..C..C..A. . 

bHV4 G C.A. -TA G. . TTC . .AA. . T . AGT . T . . TCC G G.CC. . 

3HV1 . . .TT.T-ACTA T.GAAA AA. .T. . .GA.T.AA.G. .T T. . 

eHV2 C C.ACTA G. . . .GGTGG. . .GAG. .G.CC A. . . .G.CC. .T. . 

mHV68 .C.GGTG.A-GTG T.GG. -C. .T.AA, .T. .GAA.A.A.GC. -T. .T. -T. . 

hEBV GCG.GTG.A.GTA- . . C . C . . . . G A T. . .C. .CAA. . .AA.G. . CACC . G . . . 

LTDNIQRYFSQPVIYAEPGW 

KSHV GCTTACGGATAACATTCAAAGGTACTTTAGCCAGCCGGTCATCTACGCGGAACCCGGCTG 

RFHV T..A..T..C A A A. .A. .AC.G. ..T.T A.. 

bHV4 A.-G..T..G..- .TAA.C. .A T..T..A..TC....A. .T..A T 

sHVl T. .A. . , .G.GCA.T.AC. ..A...A.T A. .TAAAG.T.TT. .T..T..T 

eHV2 T . . C . . CTC . . G - . TAACGC . C . . TCAG . . T . . A . . A . AGG . G CACC . . . A . AAA 

mHV6B T..A..A.GG..T..C A T CAC.C.T. .TT.A T..T.. 

hEBV C. .GG.CA.CGGGG.G-GCC.C. . .GCC A. . . AGC . . . .T.AC.CC G. . 

FPGIYRVR 

KSHV GTTTCCCGGCATATACAGAGTTAGG 

RFHV ...C..A..T..C G...G.. 

bHV4 A..G..T..T G..A 

SHVI . C - ATGG . . A . CT GAC . C . A 

eHV2 CC . GTTGT . GTCT .... C - ACA . . A 

mHV6B .A.G..T...T.T..TC.T...C.A 

hEBV . . . GATAT . G . CT ACA . . A 
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Figure 14 



KSHV VRRYRKIATSVTVYRGLTES - - AITNKYELPRPVPLYEISHKDSTYQCFSSMKVNVNGVE 

RFHV I M. .A-- I IV 

bHV4 . .K WSQA-- .V. . RDDIS . AI . YN . . .MI,R. .H. , .A.ATVI. . IL 

sHVl . . . .K. ,T. . .RIFN.W.REGV W. .S .A. .K. . .DIM.K. . . .HNC.QIE. . .ML 

mHV68 I I.QL.IW. . . . T . - -SV . G . F . MATQAHEW . VGDF . ,1. . .YN.ATMV. .N.R 

eHV2 . .K. . , .M. .T.I.K.WS.D- - . . . .QHTRSYA VQM. . HY AVQ. .EG.HV 

hEBV . -S.T. .V.NILI.N.WYAD--SV. .RH.EKFS.DS. .TDQ. .TI. . . YNAV . MTKD . LT 

VTVYRG ATTWKYE SHMDSTY V£- 

rgltesa 
rgmceaa 

KSHV NTFTDRDDVNTTVFLQPVEGLTDNIORYFSQPVIYAEPGWFPGIYRVR 

RFHV K L.S G 

bHV4 . .YI. , .SE.KS.P. , . .A. . .E. .N L 

SHVI .SYY, . .GN.K. .D.K. .D. . .GA.T. .1. . ,KVF.D. . .LW.T. .T. 

mHV6 8 QVYV . . . G . . K . . NIR ..D...G TL.S....M..F.... 

eHV2 . - YY. . .GW.E.A. .K.AD. , .SS.T. .Q. . .EV. .T. RNLLWS . XT . 
hEBV RVYV . . . G . . I . - N , K . TG . . ANGVR . . A . . TEL . DA . . . LIWT . . T . 

-HTFTD rVFLOPV TDNIQRY pviyaep 

pvlysep 
RYFSQP 

{-) Deletions 

{ . } Amino acids identical to KSHV 
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Figure 15 



KSHV VRRYRKIATSVTVYRGLTES— AITNKYELPRPVPLYEISHMDSTYQCFSSMKVN 

RFHV VRRYIKIATSVTVYRGMTEA— AITNKYEIPRPVPLYEISHMDSTYQCFSSMKIV 

bHV4 VRKYRKIATSVTVYRGWSQA—AVTNRDDISRAIPYNEISMIDRTYHCFSAMATV 

sHVl VRRYKKITTSVRIFNGWTREGVAITNKWELSRAVPKYEIDIMDKTYQCHNCMQIE 

eHV2 VRKYRKIMTSTTIYKGWSED—AITNQHTRSYAVPLYEVQMMDHYYQCFSAVQVN 

mHV68 IRRYRKIITQLTIWRGLTTS— SVTGKFEMATQAHEWEVGDFDSIYQCYNSATMV 

hEBV VRSYTKIVTNILIYNGWYAD— SVTNRHEEKFSVDSYETDQMDTIYQCYNAVKMT 

hCMV VRVYQKVLTFRRSYAYIHTT— YLLGSNTEYVAPPMWEIHHINSHSQCYSSYSRV 

hHHV6 VRTYKKELTFQSSYRDVGW— YFLDRTVMGLAMPVYEANLVNSHAQCYSAVAMK 

hVZV ATVYYKDVIVSTAWAGSSYT-QITNRYADRVPIPVSEITDTIDKFGKCSSKATYV 

SHVSA8 ATMYYKDVTVSQVWFGHRYS-QFMGIFEDRAPVPFEEVMDKINAKGVCRSTAKYV 

hHSVT ATMYYKDVTVSQVWFGHRYS-QFMGIFEDRAPVPFEEVIDKINAKGVCRSTAKYV 



KSHV v-NGVENTFTDRDDV-NTTVFLQPVEGLTDNIQRYFSQPVIYAEPGWFPGIYRVR 

RFHV v-NGVENTFTDRDDV-NKTVFLQPVEGLTDNIQRYFSQPVLYSEPGWFPGIYRVG 

bHV4 i-NGILNTYIDRDSE-NKSVPLQPVAGLTENINRYFSQPLIYAEPGWFPGIYRVR 

SHVI V-NGMLNSYYDRDGN-NKTVDLKPVDGLTGAITRYISQPKVFADPGWLWGTYRTR 

eHV2 E-GGHVNTYYDRDGW-NETAFLKPADGLTSSITRYQSQPEVYATPRNLLWSYTTR 

mHV68 V-NNVRQVYVDRDGV-NKTVNIRPVDGLTGNIQRYFSQPTLYSEPGWMPGFYRVR 

hEBV K-DGLTRVYVDRDGV-NITVNLKPTGGLANGVRRYASQTELYDAPGWLIWTYRTR 

hCMV i-AGTVFVAYHRDSYENKTMQLMPDDYSNTHSTRYVTVKDQWHSRGSTW-LYRET 

hHHV6 RPDGTVFSAFHEDNNKNNTLNLFPLNFKSITNKRFITTKEPYFARGPLW-LYSTS 

hVZV R-NNHKVEAFNEDKN-PQDMPLIASKYNSVGSKAWHTTNDTYMVAG-TPGTYRTG 

SHVSA8 R-NNMESTAFHRDDH-ESDMALKPAKAATRTSRGWHTTDLKYNPARVEAFHRYGT 

sHSVi H-NNLETTAFHRDDH-ETDMELKPANAATRTSRGWHTTDLKYNPSRVEAFHRYGT 
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Figure 16 



Gamma 



Beta 



Alpha 



- KSHV 

- RFHV 

- bHV4 

- sHVl 

- eHV2 

- mHV68 

- hEBV 

- hCMV 

- hHHV6 

- hVZV 

- SHVSAB 

- hHSVl 
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Figure 17(a) 



KSHV 

RFHV 

bHV4 

sHVl 

eHV2 

mHV68 

hEBV 



GGTGCGGCGCTATAGGAAAATTGCCACCTCTGTCACGGTCTACAGGGGCTTGACAGAG-- 

A CA G C TA A. .A. .A C . C . . T A A- - 

-A- -AAA A A..G..A..T A . . G . G . T . CC - . - - 

A CA AA.A. .C.AA CA.A. -A. .A. . . CGTA . T . TT . AT G. . . TAGAGA 

T* CA.AAAG G. .CATG GAC. . .CA. A . . . T, G . . GC . . . - - 

AA.CA. .A. A A, -A. .AATT. .TCAAC.G. .CA. . .GGC.A. . .CA. .CACT-- 

. . . C. . CTC . . . C. CC . . G . . A . TG . . . AACA . TCTCA . . 



. AT . . - . G . TAC . C . - - 



KSHV 

RFHV 

bHV4 

sHVl 

eHV2 

TnHV6 8 

hEBV 



XCCGCCATCACCAACAAGTATGAACTCCCGAGACCCGTGCCACTCTATGAGATAAG 

-.--G.A..A A A GA....C..G T C C. 

G^A. .TG.T T.G.G TA. .AGC. . .G. .A, A. . CTATA A. . TTC 

AGGTGTT. .T. .T. .A A.GG TT.T. . .G.T. -T. . .AAA GA 

GAT A. .A. . .C. . C. CACGAGGAGCTACG . ...C..C..G,.C. - . G . CCA 

AGTT.AG TGGT. .A.T A . GG . C . CT . AG . CC . ACGAG . GG . . AG . GG . 

GA.T. .G.G CG. CC. . GGAGAA . TTCT . . . . TGACAG . . - C . A . CTGA 

{SHMDA>) agacccgtgccactctatgarathag- 



KSHV 

RFHV 

bHV4 

sHVl 

eHV2 

TtiHVee 

hEBV 



CCACATGGACAGCACCTATCAGTGCTTTAGTTCCATGAAGGTAAATGTCAACGGGGTAGA 

^ C AA.TGTA. .G. .C -A. .C . 

AATG* !a" 't. -G T. -T. .CTCG.T. . . GCAAC GTCA . T . .T. . .A.TCT 

T ATT T . AG . . T . . C . A . . TCAT .A. -G. . . .C .A. .G.A. .A AA. GTT 

GATG CA.TAT CG . .G.AC C . C. AGGGG . . . CAC T 

,T.T T AC A. AG. GCC CCA. GGTG. .A. . .AAC . CAG 

*_'g.!...T.C.T...C ACACG.GG.C . . A . G . CAAAAG . T . . . C GAC 



ycayatgga 



(ENTFA>) gtcaacggagtaga- 
( cCFSSB) acraartcragntacttctaacaacacttg 
(SHMDA>) 
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KSHV 

RFHV 

bHV4 

SHVI 

eHV2 

mHV6B 

hEBV 



AAACACATTTACTGACAGAGACGATGTTAACACCACAGTATTCCTCCAACCAGTAGAGGG 

. . .T. .G. .C. .C. .TC.G. -T. .C. .A AA. .C T G. .C. -C. - A. . 

g/. /. .C.k. .TA G. .TTC. -AA. .T.AGT.T. . TCC G G . CC - . 

. TT . T . ACTA T . GAAA AA. -T. . .GA.T.AA.G. .T T. . 

c" " . . !c!aCTA G GGTGG. . .GAG. .C.CC A G . CC . .T. . 

C GGTG.A.GTG T.GG. .C. .T.AA. .T. .GAA.A.A.GC. .T. .T. .T. . 

GCG.GTG.A.GTA. . .CC GA T. . .C. . C AA . . . AA . G . . CACC - G . . . 



-raayacnttyacnga {ENTFA>) 



KSHV 

RFHV 

bHV4 

sHVl 

eKV2 

mHV6 8 

hEBV 



GCTTACGGATAACATTCAAAGGTACTTTAGCCAGCCGGTCATCTACGCGGAACCCGGCTG 

T.,A..T..C A A A..A..AC.G...T,T A.. 

.gWi. .G TAA.C. - A T. .T. .A. .TC A. .T. .A T 

.A G.GCA.T.AC. . .A. . .A.T A. . TAAAG . T . TT . .T..T..T 

C . . CTC . . G . . TAACGC . C . . TCAG . . T . . A . . A . AGG . G CACC . . . A . AAA 

!a' !a.GG. .T. .C A T CAC.C.T- -TT.A T. .T.. 

.GG.CA.CGGGG.G.GCC-C. . .GCC A. . .AGC. . . .T.AC.CC G. . 



tgnctrttrtaagtttccatgaaatcggtcggtca (<DNIQB) 



KSHV 

RFHV 

bHV4 

sHVl 

eHV2 

mHV6 8 

hEBV 



GTTT C C CGG CATAT ACAG AGTTAGG 

. . .C. .A. .T. .C G. . .G. . 

A. .G. .T. .T G. .A 

.C.ATGG. .A.CT GAC.C.A 

CC. GTTGT . GTCT . . . . C , ACA . .A 
.A.G- .T. . .T.T. .TC.T. . .C.A 
. . . GAT AT . G . CT ACA . . A 
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Capsid/Maturation/Transport Gene ^ ^ ^ , ^ _ . . p 
G GMFPIQKMMVSEMIWPSIE 

TGGGGGCATGmCCCATTCAAAAGATCATGGTATCy^TOT^ 

RKDWIEPNFNQFYSFENQDI 
GCGGAAGGACTGGATAGAKCCAACTTCAACCAGT^ 

NHLQKRAWEYIRELVLSVSL 
AAACCATCTGCAAAAtyUiAGCnGGGAATATATCAGAGA^ 

NNRTWERELKILLTPQGSPG 
GAACAACAGAACrrGGGAGAGGGAGCTAAAAATAmCTCACGCCTO^^ 

FEEPKPAGLTTGLYLTFEIS 
GTTTGAGGAACCGAAACCCGCAGGACTCACAACGGGGCTGT ArrTMrATnfiAGATATC 



60 
120 
180 
240 
300 



APLVLVDKKYGWIFKDLYAL 

I YHHLQLSNHNOSQV* Glycoprotein B Gene 

GTVILLVCFCAGAAHSRGDT 
TGGGGACTGTCATCCTGTTGGTaGCTmGCGCAGGCGCGGCGCACTCGAGGGGTGACA 480 

F0T5SSPTPPGSSSKAPTKP 
CCmCAGACGTCCAGnCCCCCACACCCCCAGGATCTTCCTaAAGGCCCCCACCAA^^ 540 

GEEASGPKSVDFYQFRVCSA 
CTGGTGAGGAAGCATaGGTCCTAAGAGTGTGGACTTTTACCAGTTCAGAGTGTGTAGTG 600 

SITGELFRFNLEQTCPDTKD 
CATCGATCACCGGGGAfiCTTmCGGmAACaGGAGCAGACGTGCCCAGACAC^^ 660 

KYHQEGI LLVYKKNIVPHIF 
ACAAGTACCACCAAGAAGGAATTTTACTGGTGTACAAAAAAAACATAGTGCCTCATATCT 720 

KVRRYRKIATSVTVYRGLTE 
TTAAGGTGCGGCGaATAGGAAAATTGCCACCTCTGTCACGGTCTACAGGGGCTTGACAG 

SAITNKYELPRPVPLYEI5H 
AGTCCGCCATCACCAACAAGTATGAACTCCCGAGACCCGTGCCACTaATGAGATAAGCC 



HDSTYQCFSSMKVNVNGVEN 
ACATGGACAGCACCTATCAGTGCTTTAGTTCCATGAAGGTAAATGTCAACGGGGTAGAAA 
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780 



840 



900 
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Figure 19fb) 



TFTDRDDVNTTVFLQPVEGL 
ACAaTTTACTGACAGAGACGATGTTAACACCACAGTATTCCTCCAACCAGTAGAGGGGC 



PGIYRVRTTVNCEIVDMIAR 
TTCCCGGCATATACAGAGnAGGACCAaGTCAATTGCGAGATAGTGGACATGATAGCCA 

> C 



QVVLNHTVVTYSDR6TSPTP 
TCCAAGTAGTTCTCAACCACACTGTGGTCACGTACTaGACAGAGGAACCAGTCCCACTC 



TTAVCPLALWKTFPRSIQTT 
AGACCACGGCCGTGTGTCCGaGGCACTGTGGAAAACCrrCCCGCGCTCCATCCAGACTA 

HEDSFHFVANEITATTTAPL 
CCCACGAGGACAGCTTCCACTTTGTGGCCAACGAGATCACGGCCACCTTCACGGaCCTC 

TPVANFTDTY5CLTSDINTT 
TAACGCCAGTGGCCAACTTTACCGACACGTACTCTTGTCTGACCTCGGATATCAACACCA 

LNASKAKLASTHVPNGTVQY 
CGCTTAACGCCAGCAAGGCCAAACTGGCGAGCACTCACGTCCCTAACGGGACGGTCCAGT 

FHTTGGLYLVWQPMSAINLT 
ACrrCCACACAACAGGCGGACTCTATTTGGTCTGGCAGCCCATGTCCGCGATTAACCTGA 

HAQGDSGNPTSSPPPSASPH 
CTCACGCTCAGGGCGACAGCGQ6AACCCCACGTCATCGCCGCCCCCCTCCGCATCCCCCA 

TTSASRRKRRSASTAAAGGG 
TGACCACCTCTGCCAGCCGCAGAAAGAGACGGTCAGCCAGTACCGCTGCTGCCGGCGGCG 

GSTONLSYTQLQFAYDKLRD 
GGGQGTCCACGGACAACCTGTCTTACACGCAGCTGCAGnTGCCTACGACAAACTGCGGG 

GINQVLEELSRAWCREQVRD 
ATGGCATTAATCAGGTGTTAGAAGAACTCTCCAGGGCATGGTGTCGCGAGCAGGTCAGGG 
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960 



TDNIQRYFSQPVIYAEPGWF 
TTACGCy^TAACATTCAAAGGTACTTTAGCCAGCCGGTCATCTACGCGGAACCCGGCTGGT 1020 



1080 



SAEPYNYFVTSLGDTVEVSP 
GGTCTGCTGAACCATACAATTACmGTCACGTCAaGGGTGACACGGTGGAAGTCTCCC 1140 

FCYNESSCSTTPSNKNGLSV 
CTTmGCTATAACGAATCCTCATGCAGCACAACCCCCAGCAACAAAAATGGCCTTAGCG 1200 



1260 



QNRIFVETGAYTLSUASESK 
CCCAAAACAGGATCmGTGGAAACGGGAGCGTACACGCTTTCGTGGGCCTCCGAGAGCA 1320 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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Fif|iire 19rc> 



NLHWYELSKINPTSVMTAIY 
ACAACCTAATGTGCTACGAGCTCAGTAAAATCMCCCCACCAGCGTTATGACAGCCATC^ 1920 

GRPVSAKFVGDAI5VTECIN 
ACGSTCGACCTGTATCCGCCAAGTTCGTAGGAGACGCCAmCCGTGACCGAGTGCA 1980 

VDQSSVNIHKSLRTNSKDVC 
ACGTGGACCAGAGCTCCGTAAACATCCACAAMCCTCAGAACCAAT^^ 2040 

YARPLVTFKFLNSSNLFTGQ 
(HTACGCGCGCCCCCTGGTGACGmAAGTTmGAACAGTTCCAACCTATTCACC^ 2100 

LGAR NEIILTNNQVETCKDT 
AGCTGGGCGCGCGCAATGAGATAATAaGACCAACAACCAGGTGGAAACCTGCAAAGACA 2160 

CEHYFITRNETLVYKDYAYL 
CCTGCGAACACTACTTCATCACCCGCAACGAGACTCTGGTGTATAAGGACTACGCGTACC 2220 

RTINTTDISTLNTFIALNLS 
TGCGCACTATAAACACCACTGACATATCCACCCTGAACACrnrrATCGCCCTGAATCTAT 2280 

FIQNIDFKAIELYSSAEKRL 
CCTTTAnCAAAACATAGACTTCAAGGCCATCGAGCTGTACAGCAGTGCAGAGAM 2340 

ASS VFDLETMFREYNYYTHR 
TCGCGAGTAGCCTGTTTGACCTGGAGACGATGTTCAGGGAGTACAACTACTACACACATC 2400 

LAGLREDLDNTIDMNKERFV 
GTCTCGCGGGmGCGCGAGGATCTGGACAACACCATAGATATGAACAAGGAGCGCTTCG 2460 

RDLSEIVADLGGIGKTVVNV 
TAAGGGACTTGTCGGAGATAGTGGCGGACCTGGGTGGCATCGGAAAAACGGTGGTGAACG 2520 



ASSVVTLCGSLVTGFINFIK 
TGGCCAGCAGCGTGGTCACTCTATGTGGCTCATTGGmCCGGAmATAAATTTTATTA 2580 

HPLGGHLHIIIVIAIILIIF 
AACACCCCaAGGTGGCATGCTGATGATCATTATCGrrATAGCAATCATCCTGATCATTT 2640 

HLSRRTNTIAQAPVKMIYPD 
TTATGCTCAGTCGCCGCACCAATACCATAGCCCAGGCGCCGGTGAAGATGATCTACCCCG 2700 

VDRRAPPSGGAPTREEIKNI 
ACGTAGATCGCAGGGCACCTCCTAGCGGCGGAGCCCCAACACGGGAGGAAATCAAAAACA 2760 
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Figure 19(d) 



LLGMHOLQQEERQKADDLKK 
TCCTGaGGGAATGCACCAGaACAACAAGAGGAGAGGCAG^ 2820 



STPSVFQRTANGLRQRLRGY 
AAAGTACACCCrrCGGTGmCAGCGTACCGCAAACGGCCTOGTCAGCGTCTGAGAGGA 2880 

> L 

KPLTQSLDISPETGE* 
ATAAACCTCTGACTCAATCGCTAGACATCAGTCCGGAAACGGGGGAGTGACAGTGGATTC 2940 

> T 

G«5CTTATT6mGATGTAAAmAGGAAACACGGCCCGCCTCTGAAGaCCACA^^ 3000 

DNA Polymerase Gene 
H D 

ACTGCAGnATCAACCCTACTCGTTGCACACAGACACAAAmCCGTCCGCAGATCA^^ 3060 

FFNPFIDPTRGGPRNTVRQP 
ATrrTTTCAATCCAmATCGACCCAACTCGCGGAGGCCCGAGAAACACTGTGAO 3120 

TPSQSPTVPSETRVCRLIPA 
CCACGCCGTCACAGTCGCCAACTGTCCCaCGGAGACAAGAGTATGCAGGCTO 3180 

CFQTPGRP6VVAVDTTFPPT 
CCTGmCCAAACCCCGGGGCGACCCGGCGTGGTTGCCGTGGACACCACATTTCCACCCA 3240 

YFQ6PKRGEVFAGETGSIWK 
CCTACTTCCAGGGCCCCAAGCGGGGAGAAGTATTCGCGG6AGAGACTGGGTCTATCTGGA 3300 

TRRGQARNAPMSHLIFHVYD 
AAACAAGGCGCGGACAGGCACGCAATGCTCCTATGTCGCACCTCATATTCCACGTATACG 3350 

IVETTYTADRCEDVPFSFQT 
ACATCGTGGAGACCACCTACACGGCCGACCGaGCGAGGACGTGCCArnAGCTTCCAGA 3420 

DI I'PSGTVLKLLGRTLDGAS 
CTGATATCArrCCCAQCQGCACCGTCCTCAAGCTGaCGGCAGAACACTAGATQGCGCCA 3480 

VCVNVFRQRCYFYTLAPQGV 
(TrCTCTGCGTGAACGTTTTCAGGCAGCGaGCTACTTCTACACACTAGCACCCCAGG^ 3540 

NLTHVLQQALQAGFGRA5CG 
TAAACCT6ACCCACGTCCTCCAGCAGGCCCTCCAGGCTGGCTTCGGTCGCGCATCCTGCG 3600 



GCTTCTCCACCG 
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Fiq.in. 20 
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Figure 23 



TPVTLYRGHTDAAITNKYEI 
Arr^rrfiTrj^CAC TrTArrr^rvrrATCArAfiArGCAGCAATAACTAACAMTAT 



SHKIVVNGVENTFTGRDDVN 
Trr.ATf;AAAATTnTAP;Tr^rf;(^ nTrtUAAACACGnrACCGGTCGGGAT^^ 
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VYKKNIVPNHFKVRRYRKVA 
GTGTACAAGA^q*^^*'r'"^f^''TAArJiTfnTrAAr^.TArr^AGGTACAGAAAAGT^^ 60 



120 



PRPVPLYEISHMDSTYQCFS 
rrrArj^rrrCTACC^rTATArr^r^Tr ACTrArATGGACAr^ACmcCAGTGCTTlAG^ 180 



240 



KSVFLQPVEGLTDNIKRYFS 
AAAAr^aTATTTCTrrAr^rAGTCGAA r^rrAAfTGACAArATAAAGAGATACI^^ 300 



QPVLYSEPGWFPGIYRVRTT 
rAfyrACTGCTATATTrTfiAArrXr^.ATnfTrTTCCAy .TATrTACAr^^^ 360 



V N C E I V D H 

6TTAATTGT6AGATTGTAGACATGTT 380 
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